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(57) Abstract: A process of producing a transgenic multi-cellular plants or parts thereof expressing a trait of interest, said trait 
having a controlled distribution of said trait to progeny, wherein said process comprises (i) producing a first plant or a cell thereof 
having in a first locus of a nuclear chromosome a first heterologous nucleotide sequence comprising a first fi:agment of a nucleotide 
sequence encoding said trait of interest, (ii) producing a second plant or a cell thereof having in a second locus of a nuclearchomosome 
homologous to said nuclear chromosome of step (i), a second heterologous nucleotide sequence comprising a second fragment of the 
nucleotide sequence encoding said trait of interest, and (iii) hybridising said first and said second plant or cells thereof to generate 
progeny exhibiting said functional trait of interest due to binding between a protein or polypeptide encoded by said firrst heterologous 
nucleotide sequence and a protein or polypeptide encoded by said second heterologous nucleotide sequence. Further, the invention 
provides a process of producing hybrid seeds for agriculture. 
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Transgenic plants with controlled distribution of a trait to progeny 

FIELD OF THE INVENTION 

The present invention relates to a process of producing hybrid seeds. The Invention also 
relates to a process of producing a transgenic multicellular plant organism expressing a trait of 
interest and having a controlled distribution of said trait to progeny or to other organisms. The 
invention also relates to a pnacess of producing a transgenic multicellular plant organism 
expressing two traits of interest, whereby said traits have a controlled distribution to progeny. 
Preferably, one of said traits is male sterility. Moreover, the Invention relates to a process of 
producing hybrid seeds, notably for agricultural purposes. The Invention further relates to a 
plant expressing a trait, whereby the distribution of said trait to progeny is controlled. I.e. the 
probability of transferring said trait to illicit progeny, notably by cross-pollination. Is very low. 

BACKGROUND OF THE INVENTION 

The commercial use of genetically engineered crop species has caused concerns about 
the possible transfer of transgenes and traits encoded by transgenes from genetically modified 
plants (GM plants) into landraces, wild relatives or other non-GM plant varieties or related crop 
species (Ellstrand. N. C, 2001 , Plant Physiol. 125. 1 543-1545; Quist & Chapela, 2001 . Nature, 
414. 541-543), which could change the ecological balance in tiie affected ecosystems or lead 
to otiier, first of all, socioeconomic problems. Additionally, fliere Is a certain fear tiiat 
transgenes, especially antibiotic resistance genes used as transfomnatidn maricers. can escape, 
through so-called horizontal transfer, into surrounding microorganisms (Chiter et a/.. 2000, 
FEBS Lett., 481 . 164-168), thus modifying the microflora In an undesirable way. 

Altiiough many of tiiese wonies are not well justified scientifically (Christou. P.. 2002, 
Transgenic Res., 11, lii-v), ttie creation of safe and continolled transgene management systems 
Is highly desirable, as it might prevent potential problems In ttie future and shall help to protect 
tiie gemiplasm of existing plant species in tiie most efficient way. In addition, tiiere are 
problems caused by contamination of organically grown crops or non-GM crops with transgenic 
cultivars. This has a serious impact on tine mari<eting of t^nsgenic as well as non-transgenic 
crops, an Issue which cannot be ignored by producers. 
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Unlike other products generated by humans, products created by biotechnology are 
potentially self-replicating machines. Therefore, any transgenic nnateriat created by current 
technology and released into the environment has a potential of persisting there for a very long 
time. Common practice of plant genetic engineering is based on the use of expression 
cassettes and vectors that contain continuous coding sequence for the gene of interest Such 
expression cassettes are integrated into a host chromosome and upon hybridization or another 
genetic information exchange t>etween a GM plant and another organism, whether licit or illicit, 
the expression cassette is transmitted with a high probability to the progeny or another recipient 
as a functional transcriptional unit 

WOOO/52146 describes general ideas for encrypting a trait of interest by splitting 
gene(s) in two or more fragments and rejoining the fragments by trans-splicing after mating 
parental organisms, whereby the parental organisms provide said fragments. WOOO/52146 
does not go beyond general ideas, it does not contain an enabling disclosure on how these 
ideas can be reduced to practice. Notably, it does not contain an example. WOOO/71701 
describes assembly of a functional protein by intein-mediated protein trans-splicing/interaction 
for improving containment of a transgene encoding said protein. WOOO/71701 does not 
describe bringing together fragments of a protein by mating parent organisms. Further, the 
frequency of transmission of transgene according to WOOO/71701 is not sufficiently low for 
large scale applications like agriculture, notably when a transgene provides a selective 
advantage. 

wool 1 6287 relates to the creation of allelic position for transgenes, whose expression 
determines a phenotype, with the aim that the transgenes segregate to different gametes. This 
patent application does not address the problem of controlling movement of transgenes, but 
rather faiait generation, specifically male-sterility, encoded by at least two transgenes. Furtiner, 
it does not mention intein-mediated trans-splicing. Moreover, this application does not describe 
control over trait movement by splitting a trait-encoding gene in two or more fragments. 

Trait assembly from parts encoding the trait is not of high value without knowing how to 
achieve the most favorable positions of the encoding fragments in practically the most feasible 
way, in order to provide the stiictest control over undesired transmission of said trait. For large 
scale applications like for agriculture, biological safety requires that undesired transmission of 
a transgene is reduced to a frequency of practically zero. 
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Crop plants expressing as a trait of interest male or female sterility are widely used for 
hybrid seed production. Hybrid crops have on average 20% yield advantage over inbred 
varieties and production of hybrid seeds is a large industry. Many different technologies are 
used to produce hybrid seeds (for review see: Perez-Prat E. & van Lookeren Campagne. MM, 
2002. Trends Plant ScL, T, 199-202). These technologies can be conditionally divided into at 
least four groups according to the pollination control mechanism: mechanical, chemical, genetic 
and transgenic. However, one critical requirement is common for all these technologies: ideally, 
a 100% male sterile line should be used for the hybridization process arid 100% male fertility 
restoration in F, progeny should be achieved. Such stringent requirements are absolutely 
necessary for prtxlucing hybrid seeds free of contamination with selfed seeds. 

The current methods of hybrid seed production are unsatisfactory in the above respect. 
These processes are either expensive, as in the case of mechanical de-tasselling (castration) 
of com, or "leaky" as in the case of genetic approaches or both as in the case of chemical 
treatment-based method (e.g. US4569688). 

Genetic approaches preferably include the use of lines with cytoplasmic male sterility 
(CMS) mutants and fertility restorers (e.g. WO02098209). Transgenic approaches use 
predominantly plants with genetically engineered nuclear male sterility (NMS) or CMS and 
ferUllty restoration In progeny (WO8910396; US5530191: US6255564; W09832325; 
WO9201 799; US63921 1 9; WCX)1 1 6287). These approaches also require the use of a so-called 
maintalner line in order to propagate and maintain the male-sterile line. 

The transgenic systems built on one transgene providing for male sterility and another 
transgene canying the function of restoring male fertility (e.g. US62555640) guarantee neither 
complete restoration of male fertility in hybrid progeny nor complete elimination of potentially 
negative effects of the transgene providing for male sterility on the general health of said 
progeny. In other words these systems are leaky. In addition, none of the systems mentioned 
above offers a convenient way of producing and maintaining the male-sterile line. This is an 
Important element of any genetically engineered system for hybrid seed production, as the 
successful application of such a system for large-scale production depends on whether the 
male-sterile female parent line can be propagated in an economical and efficient way. In other 
words, currently there is no universal, reliable and economical system for hybrid seed 
production, which integrates all requirements necessary for maintenance of the original lines, 
hybridization process, restoration of male fertility in hybrid progeny and at the same time has 
high biological safety parameters, e.g. provides for tight control over transgene segregation. A 
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general scheme of hybrid seeds production using currently existing genetlc/transgenic 
approaches Is shown In Figure 12. 

In the present Invention, we describe a new process of producing hybrid seeds (Fig. 1 3) 
which has all necessary characteristics to match the requirements of an Ideal hybridization 
system. A comparison of the hybrid seed production system of the invention with prior art 
methods is presented In Table 1 . 

It is therefore an object of the invention to provide a process of producing a transgenic 
plant expressing a trait of interest, notably male sterility, whereby distribution of said trait to 
progeny Is strictly controlled and occurs with low probability. 

It is a further object of the invention to provide a process of producing a biologically safe 
transgenic plant, notably a male sterile plant, that expresses a trait of interest, whereby gene 
fragments encoding said trait are positioned such that undesired transmission of said trait 
occurs with low probability. 

It Is a further object of the Invention to provide a process of positioning transgenic DNA 
sequences on homologous chromosomes, notably in the same locus of homologous 
chromosomes of a multi-cellular organism. 

It Is also an object of the invention to provide a process of producing a male sterile plant 

line. 

It Is another object of the invention to provide a universal and environmentally safe 
process of producing hybrid seeds using a sterile plant line, whereby complete fertility 
restoration occurs in said hybrid seeds. 
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GENERAL DESCRIPTION OF THE INVENTION 

The Invention provides a process of producing a transgenic multi-cellular plant organism 
or parts thereof expressing a trait of interest and having a controlled distribution of said trait to 
progeny, wherein said process comprises 

(i) producing a first plant or a ceil thereof having in a first locus of a nuclear chromosome 
a first heterologous nucleotide sequence comprising a first fragment of a nucleotide 
sequence encoding said trait of interest, 

(ii) produdng a second plant or a cell thereof having in a second locus of a nuclear 
chromosome homologous to said nuclear chromosome of step (i), a second 
heterologous nucleotide sequence comprising a second fragment of the nucleotide 
sequence encoding said trait of interest, and 

(ill) hybridising said first and said second plant or cells thereof to generate progeny 
exhibiting said functional trait of interest due to binding between a protein or polypeptide 
encoded by said first heterologous nucleotide sequence and a protein or polypeptide 
encoded by said second heterologous nucleotide sequence. Said binding preferably 
involve protein trans-splicing. 

Said multi-cellular plant organisms or said parts produced by the above process may express 

two traits of Interest, a trait (1) and a trait (2), both traits having a controlled distribution to 

progeny. 

The Inventors of this invention have developed for the first time a method of rendering 
transgenic plants environmentally safe in that the transgene or a trait of interest expressed by 
said plant has a controlled distribution to progeny of said plant The invention solves a major 
problem of biotechnology, notably of plant biotechnology, since transfer of a transgene from a 
GM plant to other organisms can now be effectively controlled and limited. Transfer of a 
transgene to other organisms includes transfer to sexual progeny by cross-pollination as well 
as lateral gene transfer. The above processes make obtainable genetically modified multi- 
cellular plants with a controlled containment of a trait of interest 

In an important embodiment said trait of Interest is male or female sterility, preferably 
male sterility. In this case, the transgenic multi-cellular plant organism of the invention may be 
used for hybrid seed production by crossing with another plant that is male fertile or female 
fertile, respectively. The hybrid seeds produced using the transgenic multi-cellular plant of the 
Invention may be 100 % fertile due to a controlled distribution of the sterility trait to progeny. In 
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a partlculariy preferred embodiment, said transgenic multi-cellular plant of the Invention may 
express two tralte of Interest, a male sterility trait and a herbicide resistance trait, what makes 
amenable a novel process of producing hybrid seeds with several advantages over prior art 
processes (see below). 

In the process of the Invention, the nucleotide sequence encoding (or involved In) said 
trait Is spot into two or more fragments. Preferably, said nucleotide sequence is split Into two 
fragments of said nucleotide sequence, thus obtaining a 5' and a 3' part of the nucleotide 
sequence. Said 5* part conresponds essentially to said first fragment. Said 3' part corresponds 
essentially to said second fragment Said nucleotide sequence is typically a coding sequence 
(or an open reading frame) of a protein involved in said trait. However, said nucleotide 
sequence may contain one or more introns. To obtain said fragments, said nucleotide 
sequence is preferably spilt such that each obtained fragment, upon expression, is incapable 
of generating said trait In the absence of the other fragment Each fragment contains a 
sequence portion necessary for the function of the protein involved in said trait For example, 
if said protein involved In said trait is an enzyme, each fragment preferably contains amino 
acids necessary for catalysis or substrate binding of the enzyme. A protein involved or 
encoding a trait may be split into said fragments In many different ways provided that 
expression of said trait requires all said fragments and binding thereof to each other. Structural 
and functional Infonmatlon loiown about the protein involved In said trait may be helpful for 
finding a suitable splitting site of said nucleotide sequence. In any case, one can easily test 
experimentally whether a fragment generated by splitting a nucleotide sequence at a randomly 
chosen site Is capable of expressing a trait encoded by said nucleotide sequence. The 
following description focuses on frie prefen-ed embodiment wherein said nucleotide sequence 
encoding said trait is split into two fragments. 

Expression of said trait requires the presence of both said fragments in the same plant 
preferably In the same cells thereot Expression of said trait further requires transcription and 
translation of said firetand said second fragment and binding of the translation producte of said 
fragments to each otfier with or without peptide bond formation. Preferably, said binding 
Involves peptide bond formation between said firagments. 
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The firet fragment is Incorporated Into a first heterologous nucleotide sequence, the 
second fragment is incorporated into a second heterologous nucleotide sequence. Preferably, 
said heterologous nucleotide sequences are DNA sequences. 

PfBferabiy, said first and said second heterologous nucleotide sequence further codes 
for a first and a second binding polypeptide, respectively, that renders said polypeptides 
encoded by said first and said second heterologous nucleotide sequences capable of said 
binding. Each binding polypeptides is preferably expressed as a protein fusion with the 
polypeptide encoded by said first or said second fragment 

Said polypeptide or protein encoded by said first heterologous nucleotide sequence 
comprises, preferably consists of. a first binding polypeptide and a polypeptide encoded by said 
first fragment Said polypeptide or protein encoded by said second heterologous nucleotide 
sequence comprises, preferably consists of. a second binding polypeptide and a polypeptide 
encoded by said second fragment 

After transcription and translation, each of said polypeptides or proteins has at least the 

following two functions: 

(i) providing a part of the protein involved in said trait; 

(11) the capability of binding to tiie polypeptide or protein encoded by tfie other fragment 
Amino acid sequence portions responsible for said functions (i) and (ii) may or may not overlap. 

Said binding may or may not involve peptide bond fomiation between said proteins or 
polypeptides encoded by said first and second heterologous nucleotide sequences. Without 
peptide bond formation, said binding polypeptides may bind to each other by affinity. In tiiis 
case, said binding polypeptides may be polypeptides known to bind to each other e.g. from 
naturally occumng binding domains of protein complexes. Preferably, said binding polypeptides 
involved In said binding affinity or at least one of tiiem can be artificially engineered. Said 
binding polypeptides may e.g. be ttie components of an antigen-antibody pair. Further, said 
binding polypeptides may be selected artificially using e.g. random peptides phage display 
libraries (for review see: Barbas CF., 1993. Curr Opin. Biotechnol., 4, 526-530; In/ing et al.. 
2001. Curent Opin. Chem. Biol.. 5l31 4-324; Hoogenboom HR. 1997, Trends Biotechnol., 15: 
62-70) or yeast two-hybrid system (for review see Fields & Sternglanc. 1994. Trends Genet, 
liL 286-292; Bartel & Fields.. 1995. Methods Enzymol., 254: 241-263). Furttier. they maybe 
Intein fragments tiiat may have been rendered non-fijnctional for intein splicing. 
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In an important embodiment, said binding comprises peptide bond fonnation between 
said protein or polypeptides encoded by said first and second heterologous nucleotide 
sequences. Peptide bond fomnation between the polypeptides encoded by said fragments is 
preferred. Said binding is or comprises preferentially intein-mediated trans-splicing. For this 
purpose, said first and said second heterologous nucleotide sequences further code for 
proteins or polypeptides capable of protein trans-splicing. By said trans-splicing, tiie proteins 
and polypeptides encoded by said first and said second fragments may be linked by peptide 
bond formation. In tiiis embodiment, said binding polypeptides are preferably derived from an 
Intein capable of trans-splicing. Trans-splicing inteins may be selected from the nucleolar and 
organellar genomes of different organisms including eukaryotes, archaebacteria and 
eubacteria. Inteins that may be used for performing this invention are listed at 
http://www.neb.com/neb/inteins.htiTil. Also, an intein mentioned in a reference cited herein may 
be used. The choice of tiie intein might depend on the consensus sequences as well as the 
conditions required for efficient trans-splicing. 

For engineering said heterologous nucleotide sequences, tiie nucleotide sequence 
coding for an Intein may be split into a 5' and a 3' part tiiat code for the 5' and ttie 3* intein (as 
denoted herein), respectively. Sequence portions not necessary for intein splicing (e.g. a 
homing endonuclease domain) may be deleted. The intein coding sequence is split such that 
the 5' and the 3' Inteins are capable of trans-splicing. Regarding a suitable splitting site of the 
Intein coding sequence, ttie considerations published by SoutiiworUi et al. (EMBO J. (1998) 17, 
918-926) may be followed. The capability of ttie 5' and the 3' inteins for trans-splicing may of 
course be tested experimentally, e.g. as described by Soutiiworth et al. (ibid). Experimental 
testing may be done by trans-splicing. Experimental testing of intein portions tiiat can be 
deleted wittiout compromising tBns-«pliclng functionality may be done by trans-splicing or by 
cis-splicing. 

The 5" intein corresponds essentially to tiie first binding polypeptide. The 3' intein 
corresponds essentially to the second binding polypeptide. For engineering said heterologous 
nucleotide sequences, ttie 5' intein coding sequence Is linked to ttie 3' end of said first 
fragment The 3' intein coding sequence is linked to ttie 5' end of said second fragment Notably 
in ttie vicinity of ttie linking site, nucleotides and/or codons (amino acids) may be changed to 
achieve a desired trans-splicing functionality. 
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Said first heterologous nucleotide sequence thus may comprise: said first fragment, said 
first binding polypeptide, regulatory sequences for transcription (e.g. promoter. 3' transcription 
termination sequence) and for translation. Said second heterologous nucleotide sequence may 
comprise: said second fragment, said second binding polypeptide, regulatory sequences for 
transcription (e.g. promoter, 3' transcription temnination sequence) and for translation. Further, 
It may contain a selectable and/or a counter-selectable marker needed for producing said first 
and/or said second plant and sequences recognised by a site-specific recombinase or 
transposon sequences (cf. below). 

The process of the Invention may also be used to assemble two or more ti^its, notably 
by trans-splicing. However, different intein systems should be used for the assembly of each 
trait in order to avoid trait mis-splicing due to tiie universal nature of interaction between intein 
parts, which is independent of attached protein fragment destined for trans-splicing. 

In the process of the invention, said first plant or cells thereof may be produced by 
introducing said first heterologous nucleotide sequence into a precursor plant or cells tiiereof. 
Said second plant or cells tiiereof may be produced by introducing said second heterologous 
nucleotide sequence into a precursor plant or cells tiiereof. Said introducing may be done 
according to metiiods generally known in tiie art Preferably, botti heterologous nucleotide 
sequences are stably incorporated into a chromosome of the nuclear genome of tine first and 
ttie second plant Said first and said second plants obtained thereby are preferably made 
homozygous witii respect to tiie respective heterologous nucleotide sequences according to 
procedures known In tiie art, notably by selfing. Said first and said second plants belong 
preferably to tiie same family, more preferably to ttie same genus, and most preferably to tiie 
same species of organisms. 

The invention provides multi-cellular plants (and parts tiiereof like seeds) expressing a 
trait of interest and having a controlled distiibution of said trait to progeny, whereby a protein 
involved in said ti-ait is generated by binding, notably by trans-splicing, polypeptides encoded 
by said heterologous sequences. Said polypeptides are encoded on homologous 
chromosomes of said organism in a first and a second heterologous nucleotide sequence. 

In principle, several relative locations of said first and said second heterologous 
nucleotide sequences and tiie respective flragments exist in tiie transgenic plant of tiie 
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Invention. Said first and said second heterologous sequences in said transgenic plant of the 
Invention should be positioned such that they segregate as unliniced loci. Said unlinlced loci are 
preferably positioned so as to minimize melotic recombination or crossing-over and creation of 
llnltage between said loci. 

Possible relative locations of said first and said second heterologous nucleotide 
sequences and said Augments contained therein are generally shown in Fig. 2B using a diploid 
organism as an example. 

In case I of Fig. 2B, said first and said second fragments are located on the same 
chromosome. I.e. they are physically linked on the same DNA molecule but are separated fl-om 
each other by chromosome sequences native to the organism. The Augments will belong to 
different transcriptional units. Since crossing-over in melosis may lead to separation of the 
fragments (or the heterologous sequences containing the fragments), the probability of 
transferring the trait encoded by both fragments to progeny is reduced compared to the 
conventional case, where the trait is encoded by a continuous coding sequence. 

In case II (see Fig. 2B), said first and said second fragment are located on different 
heterologous chromosomes. The frequency of inheriting said trait encoded by the two 
fragments on different chromosomes upon self-crossing Is about 50% and upon crossing with 
an organism not canying any of these flragments 25%. In prior art cases I and II, the probability 
of transfening both fragments to progeny or to other organisms is too high for practical 
purposes, notably If the trait encoded In said fragments provides an advantage for survival or 
propagation. These cases do not represent biologically safe cases of a transgenic plant 

The Inventors of this Invention have found that the fi-equency of transfening said trait to 
progeny (upon crossing with plants not having said trait) and to other organisms can be 
enomiously reduced when said fragments are located on homologous chromosomes as 
schematically shown in Fig. 28, case 111 and IV. 

In case III (Fig. 28), the two fragments are present at different loci on homologous 
chromosomes, i.e. are linl<ed in repulsion. The closer the fragments are located, the lower the 
fiBquenqr of recombination between said loci and, consequently, transfening the trait to 
progeny as the result of cross-hybridisation. In the most preferred case (case IV in Fig. 28), the 
fragments are located in the same locus on homologous chromosomes. Thus, the trait reliably 
segregates In cross-progeny (hybrid progeny) of the multi-cellular plant of the invention. 
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Such relative locations of said first and said second heterologous nucleotide sequences 
on homologous chromosomes of the plant of the invention are achieved by hybridising, notably 
crossing, said first and said second plant or cells thereof. Said first and said second plant may 
be obtained by methods known In the art. Further possibilities are disclosed in the following. 

In one embodiment, many transfomiants are produced with said first as well as with said 
second heterologous nucleotide sequence. Then, the chromosome having said heterologous 
sequence incorporated as well as tfie location of the transfomned sequence in the chromosome 
may be detennined by genetic or molecular biological methods. Next, a transfomned plant or 
cell clone thereof having said firat heterologous nucleotide sequence at a suitable location may 
be selected. Then, a transfomied plant or cell clone tiiereof having said second heterologous 
nucleotide sequence at a suitable location relative to said first sequence may be selected. 
Thereby, a suitable pal*" of first and second plants may be chosen. 

In a second embodiment, targeted integration Into a desired locus of a desired 
chromosome is employed making use of homologous recombination. Preferably, targeted 
Integration is done using a multi-cellular plant having a targeting site pre-integrated into a 
chromosome In combination witii site-specific recombination. The latter approach is particularly 
useful for introducing said first and said second heterologous nucleotide sequence into the 
same locus of the same chromosome, as tiie same starting organism line having a pre- 
integrated targeting site may be used for tiTansfomiing said first and said second heterologous 
nucleotide sequences. Targeted integration is described e.g. in international patent application 
PCT/EP02/03266 (WO02/077246). IVIethods of creating sites for targeted integration in plants 
wfth diflerent expression profiles are described is described in PCT/US02/11924. Mettiods of 
improving the efficiency of site-targeted Integration Is described e. g. in international patent 
application PCT/EP02/03266. 

Alternatively, said first heterologous nucleotide sequence can be incorporated into a 
chromosome of the nuclear genome of the first organism and said second heterologous 
nucleotide sequence can be incorporated into tiie plastid or mitochondrial genome of the same 
or another organism. However, incorporation of both heterologous nucleotide sequences into 
nuclear chromosomes Is preferred. 
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Preferred methods of producing said first and said second plant are schematically 
depicted on the right hand side („Excision") of Fig. 2E and in Figs. 5 to 8. In these prefen-ed 
methods, steps (i) and (ii) of daim 1 are canied out by 

(a) Introducing a parent heterologous nucleotide sequence comprising said first and said 
second heterologous nucleotide sequences into a nuclear chromosome of parent 
organisms or cells thereof, 

(b) optionally selecting organisms or cells thereof having said parent heterologous 
nucleotide sequence integrated in a desired chromosome or chromosome locus, 

(c) subsequently splitting said parent heterologous nucleotide sequence so that said first 
and said second heterologous nucleotide sequences are located on homologous 
chromosomes In different plant organisms or cells. 

Said parent heterologous nucleotide sequence comprises said first and said second 
heterologous nucleotide sequence. Preferably. It further comprises sequences for excising said 
first and/or said second heterologous sequence (for details see below). Said introducing (a) 
may be done by any known transfonnatlon method (see below). Agrobacterium-mediated 
transfomiation prefen-ed. Plants or cells carrying said parent heterologous nucleotide sequence 
may be selected using a selectable mariner contained therein. Whole plants may be 
regenerated from transfbmied cells or tissue. Preferably, plants homozygous for said parent 
sequence are created. 

A plant (or a group of plants) canying said parent sequence may then be used for 
excising said first heterologous nucleotide sequence out of said parent sequene. Thus, said 
second plant may be obtained. Another plant (or group of plants) may be used for excising said 
second heterologous nucleotide sequence for obtaining said first plant The heterologous 
sequences which are not excised are located in said first and said second plant in homologous 
chromosomes, notably in the same locus of said homologous chromosomes, i.e. in iso-loci. 

The first and second plants or cells thereof thus obtained (or progeny thereof) are 
advantageously analysed for any reintegration of an excised heterologous nucleotide sequence 
into the genome e.g. by genetic or molecular biological techniques (e.g by PCR and use of 
nucleotide probes for Southern hybridisation). Plants or cells thereof may then be selected that 
contain said heterologous nucleotide sequence reintegrated at a desired locus on a 
chromosome homologous to the chromosome hartsoring the heterologous nucleotide sequence 
that has not been excised. Thus, the transgenic plant of the Invention may directly be obtained. 
Preferably, plants or cells thereof that are free of the excised heterologous nucleotide sequence 
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are selected. Said selection may comprise analysis by genetic or molecular biological 
techniques. Preferably, said selection is supported by a counter-selectable marker in the 
heterolgous sequence to be excised. Said first and said second plant are preferably made 
homozygous for said heterologous sequence that has not been excised. 

Said excising may e.g. be done using site-specific recombinases (cf. Fig. 2E). It is highly 
convenient that said excising Is done using transposons, notably non-autonomous transposons 
(l.e. a transposon not encoding the respective transposase). For the latter embodiment, said 
first and/or said second heterologous nucleotide sequence in said parent heterologous 
nucleotide sequence Is/are embedded in such a transposon. Said excision comprises providing 
a transposase for said transposon. Notably, 

(A) said firet heterologous nucleotide sequence In said parent heterologous nucleotide 
sequence Is contained in a first transposon and said second heterologous nucleotide 
sequence Is contained In a second transposon and 

(B) said firet heterologous nucleotide sequence is excised by providing a first transposase 
functional with said firet transposon and said second heterologous nucleotide sequence 
is excised by providing a second tranposase functional with said second transposon. 

Said first and said second transposons in said parent heterologous nucleotide sequence 
preferably overiap such that excision of said firet or said second heterologous nucleotide 
sequence leads to dlsmptlon of said second or said firet non-autonomous transposon, 
respectively. Overiapping transposons may conveniently be used witii a selectable and a 
counter-selectable mariner in the overiapping region as depicted in Rg. 7 and 8. 

Further, said splitting of step (c) does not necessarily require different recombinases for 
said excising said firet or said second heterologous nucleotide sequence. In a very convenient 
embodiment, said firet heterologous nucleotide sequence in said parent heterologous 
nucleotide sequence is fianked by difl^ering recombination sites of a site-specific integrase and 
said second heterologous nucleotide sequence in said parent heterologous nucleotide 
sequence Is flanked by diffening recombination sites of ttie same site-specific integrase (cf. Fig. 
21 and 22), and step (c) is carried out by 

providing said site-specific integrase to said parent organism or cells tiiereof. 

selecting progeny of said parent organism or cells thereof containing said firet 

heterologous nucleotide sequence but not said second heterologous nucleotide 

sequence, and 
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selecting progeny of said parent organism or cells ttiereof containing said second 
heterologous nucleotide sequence but not said first heterologous nucleotide sequence. 

In step (Hi) the process of the invention, said first and said second plants or cells thereof 
are then hybridised for obtaining the transgenic multi-cellluiar plant of the Invention. Hybridising 
may be sexual crossing or fusion of cells of said plants. Cell fusion may be fusion of gemi cells 
or of somatic cells. Preferably, hybridising involves pollination of plants or somatic cell fusion of 
protoplasts. Sexual crossing of plants is most prefenred. Said hybridising brings said fragments 
encoding or being involved in said trait together in one plant or cells thereof such that said plant 
exhibits said trait of Interest due to protein trans-splicing. Exhibiting said trait due to protein 
binding or protein trans-splicing means that binding or trans-splicing is a necessary condition 
for the expression of said trait of interest The production of the transgenic organism of tiie 
Invention may comprise further steps in addition to said hybridising. In the case of plants, 
examples of such further steps include: growing and harvesting seeds, seeding, and growing 
the plant of the Invention. In tiie case of protoplast fusion, such furUier steps include: 
propagating ttie fused protoplasts to obtain colonies, regeneration of plants. 

Controlled distribution of said trait to progeny means that tiie probability of transferring 
said trait to progeny Is signrflcantiy reduced compared to conventional transgenic organisms 
that have a transgene Involved in said ti^it of interest encoded in one locus of a chromosome, 
notably as a single tianscriptional unit, or on heterologous chromosomes. The frequency of 
appearance of said trait in progeny upon crossing said transgenic multi-cellular plant of tiie 
Invention with a plant devoid of said firatand said second heterologous sequences is less tiian 
10%, preferably less than 1 %, more preferably less ttian 0.1 %. even more preferably less ttian 
0.01%. an most preferably less tiian 0.001%. For comparison, tiie frequency of appearance of 
a transgene In progeny upon crossing a conventional transgenic (diploid) organism having said 
transgene In a single transcriptional unit and being heterozygous witti respect to the ti^nsgene 
with another organism of ttie same species not having said transgene is about 50%. Whettier 
a transgenic plant expressing a trait of interest fulfills tiie criteria of the Invention regarding said 
frequency can be easily checked experimentally. 



Herein, peptide bond means ttie amide linkage between the carboxyl group of one 
polypeptide and ttie amino group of anottier polypeptide. The linkage does not allow free 
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rotation and can occur in cis or trans configuration, tiie latter the most common in natural 
peptides, except for links to the amino group of proline, which are always cis (source: 
http://www.mblab.gla.ac.uk/dicHonary/). Peptide bond fonnation can be achieved through intein- 
mediated trans-splicing. 

In the process of the invention, transgenic multi-cellular plant organisms are produced. 
Among plants, crop plants including bariey. oat. rye, wheat, zea mays, rice, millet, potato, 
oilseed rape, canola, tomato, cotton, sorghum, and tobacco are prefen^ed. The processes of the 
Invention may be applied to diploid and to polyploid plants. 

Examples for traits expressible according to the invention, notably in plants, are male 
sterility, herbicide resistance, insecticide resistance, selectable marker, a counter-selectable 
marker, organism morphology, seed content, seed stability, climate adaption, vitamlne content, 
carbohydrate content and composition, fat content and composition etc. Further, said trait may 
be expression of a protein of interest, notably a pharmaceutical protein. Examples for such 
proteins are given below. In one case (cf. example 1), reporter gene is expressed In a plant of 
the invention. In another example of this invention (example 2) EPSPS (5-enolpyruvylshiklmatB- 
3-phosphate synthase) gene confemng herbicide resistance, e.g. glyphosate tolerance, Is 
expressed. Said multi-cellular plants and said transgenic multi-cellular plants of the Invention 
may be further genetically or transiently modified e.g. for providing functions necessary for said 
trans-splicing and/or said expressing of the trait of interest. Further, a second transgene 
involved in expression of said trait of interest or of a different trait may be expressed. 

The process of the invention may be used for a wide variety of applications. It may e.g. 
be used for expressing a trait of interest In said transgenic organism. Said trait may be any 
property of said organism, whether encoded by a single or by several genes. Said trait may be 
caused by expression of at least one protein. Two or more proteins may be necessary for said 
trait. In this case, it may be sufficient to control the expression of only one protein as described 
herein. It Is, however, environmentally safer to control all the proteins producing a trait by the 
processes of the invention. 

A highly Important application of said process is the production of hybrid seeds for 
generating plants for agricultural purposes or for protein production in said plants, whereby said 
plants have a controlled distribution of a trait to progeny. Said hybrid seeds allow the generation 
of plants expressing a trait of interest that is neither expressed in a parental line and quickly 
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segregates in progeny. 

Pmriiidna plants or cells themnf expressing f wn traits of interest with controlled dist|it?utiQn of 
said traits to prooenv 

The transgenic multi-cellular plants or parts thereof produced according to the invention 
may be made to express two (or more) traits of interest, whereby both traits may have a 
controlled distribution to progeny as defined above. For the preferred case of two such traits of 
interest, these are referred to in following as trait (1) and trait (2). The above description 
regarding said trait of Interest may apply to said trait (1 ) or to said trait (2). Preferably. It applies 
to said trait (1 ) and to said trait (2). However, expression of trait (1 ) or trait (2) may depend on 
RNA trans-splicing of mRNA expression products of said first and said second heterologous 
nucleotide sequence. Translation of the trans-spliced RNA may in this case generate one of 
said traits (1) or (2). RNA trans-splicing is described In detail in WO02/96192 and in references 
cited therein. It is also possible to expressed two or more traits via RNA trans-splicing. 

Preferably, the progeny generated in step (iii) of the process of the invention (i.e. the 
transgenic multl-cellular plants or parts thereof according to the invention) exhibits trait (1 ) and 
trait (2) due to binding between a protein or polypeptide encoded by said first heterologous 
nucleotide sequence and a protein or polypeptide encoded by said second heterologous 
nucleotide sequence. Further, said progeny may exhibit trait (1) or trait (2) due to intein- 
mediated trans^plicing. Further, said progeny may exhibit trait (1) and trait (2) due to intein- 

mediated trans-splicing. 

In the process of producing a multi-cellular plant or parts thereof expressing two traits of 
Interest steps (i) and (ii) may be earned out similarly as described above in detail for one trait. 
The plant produced in step (1) (plant A1 in Fig. 1 3) may contain a (first) fragment of a nucleotide 
sequence encoding trait (1) and a (first) fragment of a nucleotide sequence encoding trait (2). 
The plant produced In step (ii) (plant A2 in Fig. 13) may contain another (a second) fragment of 
a nucleotide sequence encoding trait (1) and another (a second) fragment of a nucleotide 
sequence encoding trait (2). Said firet fragments (of trait (1) and of trait (2)) in the plant 
produced In step (i) may be on the same or on different chromosomes. Similarly, said second 
fragments (of trait (1 ) and of trait (2)) in the plant produced in step (ii) may be on the same or on 
different chromosomes. It is prefened that said first fragments are on the same chromosome 
and that said second fragments are on the same chromosomes. More preferably, said first 
fragments are in the same locus of a chromosome and said second fragments are in the same 
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locus of a Chromosome. Most preferably, the locus having said first fragments of said first plant 
and said locus having said second fragments of said second plant are the same loci on 
homologous chromosomes, I.e. are Iso-locl. 

In a prefened embodiment, said first fragment of a nucleotide sequence encoding trait 
(1 ) and said first firagment of a nucleotide sequence encoding trait (2) are contained In said first 
heterologous nucleotide sequence of the Invention. Similarly, said second fragment of a 
nucleotide sequence encoding trait (1) and said second fragment of a nucleotide sequence 
encoding trait (2) are contained In said second heterologous nucleotide sequence of the 
Invention. Step (iii) may then comprise hybridising said first and said second plant or cells 
thereof to generate progeny exhibiting trait (1 ) and trait (2), whereby exhibiting of trait (1 ) is due 
to binding between a protein or polypeptide encoded by said first heterologous nucleotide 
sequence and a protein or polypeptide encoded by said second heterologous nucleotide 
sequence. 

In the aforementioned preferred embodiment, a strlctiy controlled distribution of trait (1) 
and of trait (2) in tiie plant produced by the process of the invention can convenientiy be 
adhleved, If said first and said second heterologous nucleotide sequence are located in iso-loci 
in said first and ssud second plant Therefore, progeny obtained by crossing said transgenic 
multi-cellular plant of tiie Invention tiiat expresses said two traits of interest with anotiier plant 
not containing said fragmente will express neltiier trait (1 ) nor trait (2). 

Steps (I) and (ii) are preferably canied out by 

(a) introducing a parent heterologous nucleotide sequence comprising said first and said 
second heterologous nucleotide sequences into a nuclear chromosome of parent 
organisms or cells thereof, 

(b) optionally selecting organisms or cells thereof having said parent heterologous 
nucleotide sequence Integrated In a desired chromosome or chromosome locus, 

(c) subsequentiy splitting said parent heterologous nucleotide sequence so that said first 
and said second heterologous nucleotide sequences are located on homologous 
chromosomes in difterent plant organisms or cells, 

whereby said first heterologous nucleotide sequence of said parent nucleotide sequence 
contains the first fragment of trait (1) and ttie first fragment of trait (2), and said second 
heterologous nucleotide sequence of said parent nucleotide sequence contains tiie second 
fragment of trait (1 ) and tine second firagment of trait (2). Said splitting of step (c) may be canied 
out as described above, whereby plant A1 and plant A2 may be obtained. Preferably, said 
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plants produced In step (i) and in step (ii) are selfed for rendering them homozygous for said 
first and/or said second heterologous nucleotide sequence. 

Examples for trait (1) and for trait (2) may be those given above. 

Process of hybrid seed producMon 

In an embodiment of utmost importance, trait (1 ) is hertjicide resistance and trait (2) is 
male or female sterility, whereby male sterility is prefened. In this embodiment, the process of 
the invention may be used for hybrid seed production for agricultural purposes. Thus, the 
Invention provides a process of producing hybrid seeds, comprising producing a transgenic 
multi-cellular plant according to the invention (refen-ed to herein as plant A1/A2 in Fig. 13). 
Preferably, trait (1) is a herbicide resistance and trait (2) Is male sterility. Said process of 
producing hybrid seeds typically further comprises crossing said transgenic multi-cellular plant 
organism with another plant that Is male fertile (referred to herein as plant B in Fig. 1 3). Plant B 
should not contain a fragment of a nucleotide sequence encoding said herbicide resistance or 
said male sterility. The hybrid seeds growing on the male-sterile hertjldde resistant plant A1/A2 
may then be harvested. The Invention also provides the hybrid seed obtained thereby. 

The use of said herbicide resistance trait said the process of producing hybrid seeds 
has the following advantages (cf. Fig. 13): said resistance may be used for selecting plants 
containing said parent heterologous nucleotide sequence (line A in Fig. 13). Further, said 
herbicide resistance may be used for selecting male sterile cross-progeny in step (iii) of the 
Invention (cross-progeny of line A1 and line A2 in Fig. 1 3). as non-sterile self progeny of line A1 
and non-sterile self-progeny of line A2 is not hertiicide resistant. Consequently, purely male 
sterile stands of plants may be obtained, and, upon crossing with line B, progeny seeds 
growing on the male sterile line A1/A2 will be 100% hybrid. Self-progeny seeds growing on 
plants of line B may be separated by harvesting seeds of line A1/A2 separately from seeds 
growing on line B. In contrast to prior art processes of producing hybrid seeds using male 
sterile plant lines, the process of producing hybrid plants disclosed herein is of much more 
efficiency and less laborious to perform, as the plant lines A1 and A2 may easily maintained by 
selfing. 

Une A containing the pro-locus sequence (Fig. 13) may be male sterile. This is 
advantageous for generating primary transfonnants of line A with a desired phenotype (e.g. 
male sterility, herbicide resistance etc), but maintenance of line A may then be difficult. Une A 
may therefore be designed such that it is fertile, but lines A1 and A2 may still provide male 
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Sterile plant A1/A2 upon crossing. This may be achieved by separating, In said parent 
heterologous nupleotlde sequence of line A (pro-locus), one of the fragments of the nucleotide 
sequence encoding the male sterility trait from its promoter. Then, said pro-locus would not 
provide for male sterility, as one of the fragments encoding male sterility is not expressed. 
Creation of iso-loci (lines A1 and A2) may bring together promoter and fragment such that said 
fragment can be expressed, thus allowing to obtain male sterile A1/A2 plants. As an example, 
said first heterologous nucleotide sequence may intemjpt said second heterologous nucleotide 
sequence in the pro-locus. Upon creation of lines A1 and A2, excision of said first heterologous 
nucleotide sequence may restore the functionality of said second heterologous nucleotide 
sequence. 

Due to the controlled distribution of both traits to progeny, the cross-progeny (F1 
progeny in Fig. 13) will show hybrid vigor and have restored fertility and restored sensitivity to 
the herbicide the plant A1/A2 was resistant against Preferably, sterility and hertiicide sensitivity 
are restored in at least 96% of the progeny, more preferably in at least 99% of the progeny. 
Consequently, said F1 progeny may be used for large scale planting on farm fields without any 
danger of outcrossing or transferring a functional herbicide resistance gene in the environment 

In example 4 of the Invention, engineering of split AHAS gene providing for resistance 
to imidazoline and sulfonylurea herbicides is described. The AHAS gene was PGR amplified 
from Arabidopsis genomic DNA. mutated and cloned in vectors (Fig. 16) for testing its 
functionality in transient assays. In example 5, engineering of split bamase providing for a 
cytotoxic RNase is described. In both examples, we use the intein system to provide for trans- 
splicing of proteins encoded by spilt gene fragments. Trans-splicing is mediated by two different 
intein systems which do not cross-react with each other. This system is based on 
Synechocystis sp. PCC6803 DnaE intein for AHAS and the DnaB intein for bamase. The 
intemiediate constructs with split AHAS-intein fusions and split bamase-intein fusions are 
shown in Figures 1 7 and 1 8. respectively. 

Transient test experiments showed the intein-mediated assembly of functional proteins 
encoded by gene fragments. The invention is not limited to the use of the AHAS gene providing 
for herbicide resistance. Many other genes confening herbicide resistance can be used, subject 
to con^ct splitting and reconstruction by intein-mediated trans-splicing. Examples of such 
genes include inter alia 5-enolpyruvylshlkimate-3-phosphate synthase, phosphinothricin acetyl 
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transferase (BAR), betaine aldehyde dehydrogenase (BADH), dlhydrofolate reductase (DFR1). 
acetolaotate synthase (ALS), glyphosate oxidoreductase. 

Further, bamase is one of several possible genes that may provide for male sterility. 
Many other genes that affect pollen development when expressed in anther cells or at a desired 
stage of pollen fonmation may be employed. Actually, any gene, the gene product of which is 
capable of Interfering with the function and development of pollen can be used In this invention. 
Examples of such genes inter alia ribosomal inhibitor proteins (Cho et ai, 2001 , Mol. Cells, H, 
326-333), sucrose isomerase (W0159135), protease, glucanase (Tsuchia etal., 1995, Plant 
Cell Physiol.,36. 487-494), etc. Alternatively, genes responsible for self-incompatibility 
(preventing self pollination of plants containing said genes) may be used to provide for hybrid 
seeds production, notably Instead of the male sterility trait discussed above ( Entani, T.. et ah, 
2003. Genes Cells, g. 203-213; Ushijima, K., et al., 2003. Plant Cell. 15. 771-781 ). 

Various pollen or tapetum-specific promoters can be used to drive the expression of a 
gene/gene fragments for producing male sterility. Examples of tapetum specific promoters are 
promotere of the A3 and A9 genes (US5723754; Hodge ef al., 1991. J Exp. Botany. 42. 238 
Suppl. p. 46), the tapetum-specific promoter from rice Osg6B gene (Tsuchia et al., 1994, Plant 
Mol. Biol., 26, 1737-1746). the promoter of tobacco gene TA29 (Kriete et al.. 1996, Plant J., 
9:809-818), etc. Tissue-specffic expression of a gene providing for male-sterillty is described in 
detail in W098/32325. 

In the next step of cloning, said gene fragments were assembled in pairs in intemiadiate 
constructs (Rg. 19) designed for final pro-locus vector engineering (Fig. 21) according to the 
descripton in example 6. Said pro-locus vector is designed for generation of parental line A, as 
described in example 8. Said parental line that will be male-sterile can be selected by using the 
herbicide resistance provided by split AHAS gene. For generating lines Al and A2 flnom the 
parental plant, site-specific recombination may be used. A description of vectors providing for 
recombinase activity is presented in example 7. The transgenic plants carrying recombinase 
genes may be generated in the same way as the parental plants carrying pro-locus. Methods 
of transformation are exemplified in example 8. 

In order to generate lines Al and A2 carrying iso-loci, primary transfomnants 
Gon-esponding to the parental line were cross-pollinated witii pollen flrom the plant providing for 
recombinase activity (example 8). The progeny from such crosses was tested by PGR for the 
presence of heterologous DNAconiesponding to one and tiie absence of tiie heterologous DNA 
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corresponding to the another iso-locus and vice-versa. The generation and structure of iso-Ioci 
is shown in Figure 22. Generated lines A1 and A2 carrying different iso-Ioci were tested for their 
functionality by cross-pollination. If homozygous lines were used, all progeny from such lines 
was herbicide resistant and male sterile. In Figure 22, we demonstrate the possibility of 
generating iso-Ioci from a pro-locus with the help of one site-specific recombinase. For 
recombinase PhiC31, recombination (excision or integration) requires two different 
recombination sites, AttP and AttB. Recombination catalysed by this integrase is an in-eversible 
process, as it leads to the fonmation of AttL or AttR sites that are not recognised by 
recombinase PhiC31 . The pro-locus shown in Figure 22 contains three such sites and random 
interaction between two of them (catalysed by the integrase) would lead to excision event with 
two possible outcomes, generating either line A1 or line A2 with iso-Ioci. in contrast, a similar 
approach with parental line transfomied with vector plCH12970 (Rg. 21) will produce four 
different variants of Iso-Ioci with and without HPT selectable marker due to the presence of an 
additional AttB site. 

The approach with said pro-locus in parental line A has important advantages over 
Icnown hybrid seed production systems: it allows to directly select primary transformants 
showing the required male sterile phenotype; fertility restoration during the generation of lines 
A1 and A2 with iso-Ioci from parental line may be tested. This reduces the time neccesary for 
developing the hybrid seed production system of the invention and makes its maintenance 
convenient and straightforward. 

In addition, the approach of the invention is easily compatible with other methods, for 
example with methods of controlling seed gennination. Controlling seed genmination may 
address specific biosafety issues, especially in the case of producing Industrial enzymes, 
proteins for human and animal health, etc., in hybrid plants. Controlling seed gennination can 
eliminate the problem caused by plant -"volunteers" which frequently contaminate the following 
harvest and may pose a serious biosafety problem, especially in case of "phanna" proteins. 
There are several reports addressing the issue of controlling seed germination (US5723765; 
W09744465; US5129180; US5977441), however, these methods are not integrated into a 
process of producing hybrid seeds. Controlling the germination of seeds harvested from hybrid 
plants may be done according to the general teaching of this invention. Preferably, the hybrid 
(F1) plant is homozygous for an Inactive locus A3 (see Rg. 14D) that can control seed 
germination after being activated {the activated locus A3 is designated A3* in Fig. 14D). This 
would provide all progeny of F1 plants witii locus A3. Said homozygocity in F1 may be 
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achieved by introducing a heterologous sequence controlling seed germination in a pre- 
determined position of a nuclear chromosome of line A1/A2 (or ist precursors line A1 , A2 or line 
A) and in line B e.g. via homologous recombination or site-directed integration. Alternatively, 
introgression of the desired locus by standard breeding methods is also possible. In addition, 
the hybrid plant (F1) should contain an activator (A4 + B4) for said Inactive locus (A3), said 
activator may be encoded by two heterologous nucleotide sequences, A4 and 84 (Fig. 14D). 
Sequences A3, A4, and B4 may be brought together as the result of crossing between line 
A1/A2 and line B to produced F1 plants. In F1 plants, the activator can be rendered functional 
by Intein- or ribozyme-mediated trans-splicing of protein or RNA sequences, respectively, 
expressed from sequences A4 and B4. Preferably, said activator is a recombinase or a 
transposase under conti-ol of a transientiy active promoter (US597741 ), whereby said promoter 
Is preferably not embryo-, seed- or seed gennination specific, i.e. it does not overiap witii or 
proceed the expression partem of the promoter driving the expression of gene(s) of the A3 
locus that controls seed germination. The promoter controlling A3 and said gene controlling 
seed gemiination (A3) may be separated by a blocking sequence which can be removed by 
said recombinase/transposase used as said activator. Alternatively, said promoter controlling 
A3 or said gene controlling seed germination can be re-oriented relative to each other by site- 
specific recombination, resulting in activation and expression of A3. The activated A3 (A3*) will 
be inherited to progeny of F1 plants. Self-progeny of F1 plants will be homozygous for A3*, 
cross progeny of F1 plants will be heterozygous for A3*. Consequentiy. progeny seeds of the 
F1 plants will not be viable, ie. stop growtii in an eariy stage of development 

The development of a plant can be divided into two major groups of stages following 
germination: vegetative stages (V) and reproductive stages (R). Vegetative stages begin with 
emergence stage (VE) followed by the cotyledon stage (VC) and by consecutive stages of 
vegetative development until the beginning of reproductive stages (beginning bloom). Thus, the 
invention also provides plants grown from the hybrid seeds of the invention, wherein progeny 
seeds of said (hybrid) plants do not reach ihe cotyledon stage, preferably they do not reach the 
VC stage, preferably they do not reach the VE stage, most preferably they do not germinate. 
Using this embodiment, hybrid plants witii a potentially problematic genetic content may be 
used e.g. for expressing a protein of interest, witinout the danger that seeds from these plants 
give rise to unwanted plants in the next growing season. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 

General scheme of Inteln mediated trans-splicing resulting in functional protein fomiation. 



Rgure 2 

A - depicts the general principle of the invention, where frans-spiicing-mediated fomiation of a 
functional protein takes place in cells of hybrid progeny; 

B - depicts four possible relative locations of the first and the second heterologous nucleotide 
sequences on host chromosomes of an organism. Case 111 and IV show relative locations of 
said heterologous sequences in the transgenic multi-cellular plant of the invention. A diploid 
organism having two chromosomes and a trait of interest encoded by two fragments (A and B) 
is used as an example. 

C - depicts the basic principle of achieving allelic locations of said first and said second 
heterologous nucleotide sequences providing for trans-splicing by means of site-targeted 
integration. 

D - depicts the basic principle of achieving allelic locations of said first and said second 
heterologous DNA sequences providing for trans-splicing by means of transposition. 
E - general scheme of methods for achieving allelic locations of different heterologous DNA 
sequences on homologous chromosomes. 

Figure 3 shows schematic representations of T-DNA regions for plasmids plCS'gfplnt and 
plCintgfpS'. 

Rgure 4 depicts schematic representations of T-DNA regions for plasmids plCS'epspnnt and 
plCint-epspS". 

Figure 5 depicts schematic representations of T-DNA regions for plasmids plC5'epsp-intM and 
plClnt-epspS'M. 

Rgure 6 depicts a schematic representation of a constmct designed for achieving allelic 
location for the 5' or 3' parts of EPSP coding sequence (A) and its derivatives (B and C) 



wo 03/102197 



PCT/EP03/02986 



24 

resulting from excision of non-autonomous transposable elements (Ds or dSpm. respectively) 
upon exposure to transposase source. 

Figure 7 depicts a general scheme of a construct (center) designed for achieving allelic 
locations of different heterologous DNA fragments (hDNA 1 and hDNA 2) by way of 
transposition-mediated removal of unwanted fragments upon exposure to a transposase 
source. SM - selectable transformation marker; CSM - counter-selectable marker. 
On the top. the unwanted fragments excised by the action of the respective transposase are 
shown. At the bottom, the desired fragment left behind by the transposition are shown. 

Rgure 8 shows a schematic representation of a method of generating plants with different 
heterologous DNA fragments (hDNA 1 and hDNA 2) in allelic locations using transposition. A 
transposase is provided to progeny of plant 1 by crossing plant 1 with plant 2. SM - selectable 
marker gene; CSM - counter-selectable marker gene; TPase - transposase. 

Rgure 9 depicts intermediate constructs and Binary vectors used to make constructs shown In 
figure 3 and 4. 

Rgure 10 depicts a map of plasmid plCH5300. 

Rgure 1 1 depicts a map of Icon Genetics Binary vector pICBV16. 

Rgure 12 depicts the general schemes for existing geneticAransgenic hybridization systems. 
Current systems require to engineer three plant lines - a male sterite line, a maintainer line, and 
a ferfility restores line. 

Rgure 13 depicts schematically the principle of the process of producing hybrid seeds 
according to the present invention. This system requires to design only one original parental 
line A Witt! pro-locus containing the parent heterologous nucleotide sequence of the invention. 
Line A may be herbicide resistant (H'') and male sterile (ms). allowing selection using the 
appropriate herbicide for the resistance trait employed. Splitting of said parent heterologous 
nucleotide sequence leads to line A1 and line A2 containing said first and said second 
heterologous nucleotide sequence, respectively. Lines A1 and A2 are therefore male fertile and 
herbicide sensitive (H^). Lines A1 and A2 may be maintained by selfing. Crossing of line A1 and 
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line A2 leads to the male sterile and herbicide resistant line A1/A2 of the invention, whereby 
self-progeny of line A1 and self-progeny of line A2 may be eliminated using said herbicide 
resistance. Crossing of line A1/A2 with a line B that may be a wilde-tpye ( WT) line leads to 
seeds (F1 progeny) growing on A1/A2 plants. When said F1 progeny seeds are sewed. F1 
plants growing therefrom will show hybrid vigor. 

Figure 14 A-D shows steps of the process of producing hybrid seeds according to the invention. 
A - scheme of creating lines A, and A^ with iso-loci from parental line A having a pro-locus 
containing the parent heterologous nucleotide sequence depicted at the top. Treatment of line 
A with reoomblnase A1 removes a part of the parent heterologous nucleotide sequence 
containing fragments HR5' and MS5'. thus fonning line A1. Treatment of line A with 
reoomblnase A2 removes a part of the parent heterologous nucleotide sequence containing 
fragments HR3' and MS3', thus fonning line A2. 

All the gene fragments may be designed as translational fusions with intein fragments capable 
of trans-splicing. Filled and dotted triangles show the recombination sites recognised by 
difierent site-specffic recomblnases. 

SM - selectable marken HR 3' - 3' fragment of gene conferring herbicide-resistance; HR 5' - 
5' fragment of the gene confening heriDicide resistance; MS 3' - 3' fragment of the gene 
providing for male sterility; MS 5' - 5' fragment of the gene providing for the male sterility. 
B - creation of male sterile line (at the bottom In the middle) by crossing line A1 and line A2. 
Self-progeny of line A1 (left picture at the bottom) and self-progeny of line A2 (right picture at 
the bottom) can be eliminated due to he*icide sensitivity, allowing pure stands of the male 
sterile hertjiclde resistant line A1/A2 (at the bottom in the middle). 

C - production of hybrid seeds by crossing line A1/A2 (line A1XA2). All progeny is hert>icide 
sensitive and male sterile. Cross progeny shows hybrid vigor, whereas self-progeny of line B 
does not Self-progeny seeds growing on plants of line B may be separated from cross-progeny 
seeds growing one line A1/A2 by harvesting them separately. 

D - shows production of hybrid seeds providing for F2 progeny with controlled seed 
gemiination. A3 locus provides for controlling the seed gennination once activated (A3*) by 
activator provided by A4 and B4 loci. 

Rgure 15 depicts a possible approaches to generate iso-locl . SM - selectable marker. Riled 
and dotted triangles show the recombination sites recognised by different site^pecific 
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recombinases. 

Figure 16 depicts sciiematic representations of T-DNA regions of plasmids pICH12590 and 
PICH12600. 

Rgure 17 depicts a schematic representation of the T-DIMA regions of plasmids pICH12610 and 
PICH12650. 

Figure 18 depicts schematic representations of T-DNA regions for piasmlds plCH12830 and 
PICH12840. 

Figure 19 depicts a schematic rerepresentation of T-DNA regions of constructs plCH12910 
and PICH12950. 

Figure 20 depicts schematic representation of T-DNA regions of plasmids plCH12870, 
PICH13130 and plCH13160. 

Figure 21 depicts schematic representation of T-DNA regions of plasmids pICH 12960 and 
PICH12970. 

Figure 22 depicts pro-locus firam plCH12960 of line A (top) and splitting of the parent 
heterologous nucleotide sequence for generating iso-loci from a pro-locus of the T-DNA region 
of plCH12960. The pro-locus contains AttP and AttB recombination sites of an integrase. 
Application of the integrase leads to statistic removal of of one partof tiie pro-locus or the otiier 
part, tiius leading to line A1 and to line A2. Molecular analysis e.g. by PGR is typically be 
carried out for analysing the recombination result 
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DETAILED DESCRIPTION OF THE INVENTION 

In this invention, we propose to split the coding sequence of a transgene involved in a 
trait of interest Into two or more fragments that can be bound to each other on the protein level, 
notably by trans-splirang. Heterologous nucleotide sequences containing these fragments are 
Introduced into the genome of a host plant, preferably into homologous chromosomes, or in the 
genome and the plastome of a transgenic multi-cellular plant, by hybridising parent plants. 
Once transcribed and translated, the protein fragments can be assembled by protein trans- 
splicing, thus fbmiing a functional protein, notably a protein which can provide for the trait of 
interest Since the plant breeding process usually involves very specific parental crosses, 
managing said process of the Invention does not pose serious additional problems. Any 
undeslred. spontaneous cross between the transgenic plant of the invention and unwanted 
organisms effectively disassembles said trait, thus abolishing expression and greatly reducing 
the chance of funcBona! gene transfer to illicit progeny. 

The processes of the invention allow to build mechanisms that would control either the 
expression of the transgene per se or It could be utilized to control the transgenic variety, as the 
progeny of any Illicit cross Is rendered non-viable. Both of these possibilities are inter alia 
contemplated in our Invention. 

The Invention also allows one skilled in the art to design schemes for selecting primary 
transfbmiants based on a selectable marker that is effective and operable in the To progeny, 
but fragments or alleles of which, upon subsequent crosses, segregate to different transgenic 
progeny and ttius disappear as a functional selectable mari<er gene. 

Furthennore. the Invention allows rapid in vivo assembly of different genes by crossing 
parents that contain different fragments of a transcriptional unit of interest, thus allowing to 
swap different functional domains, such as translational enhancers, transit or signal or targeting 
peptides, purification tags, different functional domains of proteins, etc., by simply crossing 
plants carrying desired fragments of such a functional gene. 

There Is a description of a hybrid seeds production system based on bamase gene 
fragments. If saW fragments are expressed In the same cell (anther cells), the protein fragments 
produced associate, whereby bamase activity is restored, generating male sterility 
(US63921 19; Burgess etal.. 2002, Plant J., 31. 113-125). Hybrid seeds produced with the help 
of said approach recover fertility due to the segregation of bamase gene fragments to different 
gametes, thus causing the inactivation of the cytotoxic gene responsible for male sterility. 
However, said system has serious limitations as It Is built on protein fragment interactions, not 
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trans-splicing. As the result said system is temperature-sensitive: temperatures higher than 
18°C may restore fertility of the male-sterile line by dissociating the bamase protein fragments. 
In the present invention, protein binding and/or trans-splicing can be achieved by using 
engineered inteins. Intelns were first identified as protein sequences embedded In-frame within 
protein precursor and excised during protein maturation process (Perier et al, 1994, Nucleic 
Acids Res., 22, 1125-1127; Perier. F.B., 1998. Cell, 92, 1-4). All infomiation and catalytic 
groups necessary to perform a self-splicing reaction reside in the intein and two flanking amino 
acids. The chemical mechanism of protein splicing is described In detail by Perier and 
colleagues (1997, Cu/r. Op/n. Chem, Biol., i 292-299) and by Shao & Kent (1997. Chem. BioL, 
4, 187-194). Inteins usually consist of N- and C-tennlnal splicing regions and central homing 
endonuclease region or small linker region. Over 100 inteins are known so far that are 
distiibuted among tiie nuclear and organellar genomes of different organisms Including 
eukaryotes, archaebacteria and eubacteria (http://www.neb.com/neb/inteins.html). It was shown 
that intein molecules are capable of trans-splicing. The removal of the central homing 
endonuclease region does not have any effect on intein self-splicing. This also made possible 
the design of trans-splicing systems, in which the N-temiinal and C-temninal fragments of intein 
are co-expressed as separate fragments and, when fused to exteins (protein fragments, being 
ligated togettier witii ttie help of intein), can perfonn trans-splicing in vivo (Shingledecker et al., 
1 998, Gene, 207, 1 87-1 95). It was also demonstrated witii N- and C- temninal segments of Uie 
Mycobacterium tuberculosis RecA intein, that protein trans-splicing could take place in vitro 
(Mills et al., 1998. Proc. Nati Acad. Sci, USA, £5, 3543-3548). This phenomenon was also 
identified for DnaE protein of Synechocysfis sp. strain PCC6803 (Wu a/., 1998. Proc, Natl. 
Acad. ScL USA, 95, 9226-9231). Two different genes located more ttian 700 Kb.p. apart on 
opposite DNA strands encode ttiis protein. It was also shown that two intein sequences 
encoded by those genes reconstitute a split mlni-intein and are able to mediate protein trans- 
splicing activity when tested in Esherichia coli cells. The intein molecule of ttie same origin 
(DnaE intein from Synechocystis sp. strain PCC6803) was used to produce functional 
heri^icide-resistant acetolactate syntiiase II from two unlinked fragments (Sun etal., 2001 , AppL 
Environ. Microbiol., gZ, 1025-29) and 5-enolpyruvylshikimate-3-phosphate synttiase (EPSPS) 
(Chen et al., 2001, Gene, 263, 39-48) in E. coli. 

The general principle of intein-mediated trans-splicing is shown in Fig.1. 

Yet another well established application of inteins is tiieir use for intein-based protein 
purification systems (for short review see Amitai & Pietrokovski (1 999, Nature BiotechnoL, 1 7, 
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854-855). The setf cleavage of intein from its extein releases extein as free protein molecule 
after the expose to either pH (Wood et al., 1999, Nature BiotechnoL. 12. 889-892) or 
temperature (Sourthworth et al., 1999. Biotechniques. 27. 110-114) changes. Altematively. 
nucleophilic agents (e.g. DTT) also initiate cleavage, but such agent remains covalentiy linked 
to the released protein (Klabunde et al., 1 998, Nat. Struct Biol., 5, 31-37). To the best of our 
knowledge, there is no prior art describing the use of intein-mediated protein trans-splicing for 
assembly of useful traits in plant cells in a biologically safe and controllable way. The general 
scheme of trans-«pliclng mediated trait assembly in progeny is shown in Figure 2A. None of 
two parental lines (P, and Pa) has a fully functional linear gene encoding said trait. In contrast, 
each contains fragments (A or B) of said gene preferably located on homologous 
chromosomes. As a result of hybridization between P, and P^, a progeny is generated that 
provides for a functional trait due to trans-splicing mediated assembly of proteins encoded by 
fragments A and B. tt is evident from said Figure, that only one fourth of S, progeny derived 
from self-pollination of the primary hybrid will retain the trait of interest, while the other half will 
inherit only one out of the two fragments required for providing said trait, and one fourth will 
have neither A or B. It is also evident, that cross-pollination with any other plant (illicit cross) 
having none of the fragments A and B will not lead to transmission of the trait, as only one of 
the two fragments necessary for functional gene is transmitted to each progeny plant 

There are several developed approaches and engineered inteins. which can be used to 
pracHce this Invention (references cited above). They actually cover the use of all known types 
of inteins in order to engineer trans-splicing events in eukaryotic cells. In EXAMPLE 3 we 
describe intein-mediated interaction, which brings together two domains of EPSP synthase 
providing for heri^icide resistance. It demonstrates the possibility of assembling a functional 
protein dimer by bringing together domains necessary for function without actually protein 
trans-spllcing taking place. Such intein-mediated protein-protein interactions also offer an 
alternative in some specific cases to provide for a trait without protein trans-splicing. 

The processes of the invention may be used as a convenient way of assembling a 
desired sequence and/or expression unit from different parts in trans, using as modules or 
building blocks different transgenic plants. Their hybrid progeny would put together modules 
inherited from different parents through engineered intein-mediated trans-splicing. It is possible 
to form a trait of interest by choosing the appropriate pair of transgenic parents containing 
required modules, very much like by choosing an appropriate pair of parental plants for 
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producing high value hybrid seeds in traditional breeding. Examples of such modules include 
different signal peptides, binding domains (e.g. cellulose, pectin, starch binding domains, etc.). 
retention signals, comparlmentalization signals, activation domains, domains with enzymatic 
activities, affinity tags, regulatory sequences, different genes of interest and parts thereof. In 
this regard the trait of interest is understood broadly and Includes not only a functional protein 
with a specific capabilities but in particular a protein targeted to a specific compartment or 

macromolecular matrix, or protein engineered for subsequent isolation/purification. 

Additionally, trans-splicing on protein level gives many Important advantages which 

cannot be provided by RNA trans^plicing. Said advantages are the result of the following 

features: 

a) intein-mediated trans-splicing directly results in the protein molecule, while ribozyme- 
medlated trans-splicing fomis RNA molecule, which, in most cases, shall be translated 
into the protein, thus restricting the choice of cellular/intercellular compartment for said 
trans-splicing; 

b) targeting of intein-mediated trans-splicing components provides for a lot of flexibility, as 
we are dealing with protein molecules, while targeting of RNA molecules is preferably 
restricted to cytosol; 

c) engineered inteins. in addition to said above, allow for regulating trans-splicing by 
changing pH. temperature or nucleophllic agents. 

d) Inteins engineered for trans-splicing interact with high efficiency and can bring together 
protein domains that will provide for enzymatic activity following such interaction even 
witiiout the covalent link (trans-splicing) taking place. 

Such diversity in the choice of parameters for regulation of Intein-mediated trans^splicing 
or interaction (combination of compartmentalization choices with modulation of abiotic 
parameters) gives flexibility and remarirable variability of choices compared to the RNA-trans- 
splicing approach. 

However, all these potential applications have a limited value without knowing how to 
achieve the most preferable location of said heterologous fragments relative to each ottier. 
preferably on nuclear chromosomes. 

According to this Invention, said fragments are on different homologous chromosomes 
(Rg. 2B. case III and IV). In case 111. self-progeny can Inherit the trait, but said trait will not be 
inherited by progeny resulting from crossing witti plants possessing neither of said fragments 
If melotic crossing-over Is neglected or absent. Melotic recombination between the two 
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homologous chromosomes may, however, physically link said fragments A and B. The 
frequency of such recombination events directly depends from the relative distance between 
said fragments on the two homologous chromosomes. 

In order to suppress physical linkage of said fragments by meiotic recombination, said 
fragments are preferably positioned at short relative distance on homologous chromosomes or, 
most preferably, at the same locus on the homologous chromosomes (Fig. 2B. case IV). thus 
minimizing the frequency of meiotic recombination between such fragments practically to zero. 
There are different technical solutions to achieve this most preferable allelic location of said 
fragments. Said fragments can be integrated at the same locus in pre-englneered Integration 
site by means of site-specific recombination (Fig. 2C). Examples of such systems Include the 
Cre-Lox system from bacteriophage P1 (Austin et a/., 1981. Cell, 25. 729-736). the Flp-Frt 
system from Sacchammyces cerevlsiae (Broach et al., 1982. Cell, 2g. 227-234). the R-RS 
system from Zygosacchammyces /ot/x//(Araki et al., 1985. J. Mol. Biol., 182. 191-203) and the 
integrase from the Streptomyces phage PhlC31 (Thorpe & Smitii, 1998, Pmc. Natl. Acad. Sd., 
§5, 5505-5510; Grotti etal., 2000. Proc. Natl. Acad. ScL, SI. 5995-6000). 

Said ftegments A and B can be integrated within chromosomal DNA as one construct 
AB (Fig. 2D). The design of the construct should allow selective removal, of one of the DNA 
fragments (A or B) using mechanism of controllable DNA rearrangement (excision or 
transposition), thus generating progeny containing eHher fragment A or fragment B In the same 
tocus. bringing togettier both fragments or their transcripts by crossing plants possessing only 
one of said fragments or both fragments, but only one of required transcripts, will lead to 
expressing a trait of Interest 

An example of said controlled DNA rearrangement is to flank firagments A and B wltii 
sequences recognized by different site-specific recombinases and. upon provision of flie 
respective recombinase. to selectively remove elflier fragment A or fragment B. Alternatively, 
ttie placement of a transcription initiation region (promoter) flanked by Inverted recombination 
srtes just between said fragments can lead to selective transcription of eittier fragment A or B 
depending on said transcription initiation region orientation. The inversion of orientation (but not 
excision) of said region can be induced by exposure to ttie recombinase source. As ttie result, 
it is possible to achieve selective transcription of elttier fragment A or fragment B witiiout 
(physically) removing them, but using DNA inversion as a switch. However, tiie case of 
selective expression of one heterologous DNA fragment in the presence of botii heterologous 
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DMA sequences at the same location is not among the preferred embodiments of this invention, 
as this may not provide the required level of control over trait expression and movement. 

An important embodiment of said controlled DNA reanangement comprises the use of 
transposition, wherein one of the fragments, for example fragment B, is located and transcribed 
within a non-autonomous transposable element, and its excision from the construct will trigger 
transcription of fragment A. Excision of the transposon may or may not be followed by its 
reinsertion elsewhere and progeny can be selected that contains fragment A or B only. Taking 
into the consideration that most of transposon reinsertions occur at positions closely linked to 
the donor site (Jones et ai. 1990. Plant Cell. 2. 701-707; Carroll et ai. 1995. Genelics. J39, 
407-420). the chance of selecting progeny containing fragments A and B linked in repulsion 
(on different chromosomes of chromosome pair) is very high. Figure 2E summarizes a variety 
of approaches for achieving an allelic location of the first and the second heterologous 
nucleotide sequences including site-targeted integration and excision of said fragments. 
Transposase-mediated or recombinase-mediated excision of said flragments can be achieved 
with the help of one (Fig. 2E. a/. aV, d/ and d'/) or two different transposon or recombinase 
systems (Fig. 2E. b/. bV, d, cV. e/. eV. f/ and f/). The use of two different systems Is preferred. 
The use of two different transposon sytems is more preferred. The use of two different 
transposon systems with overiapping transposon ends Is the most prefenred (c/ and c'/) 
embodiment. 

The description of construct design for trait assembly through intein-mediated protein 
trans-splicing (Figures 3. 4) or Intein-mediated protein fragment Interaction (Fig. 5) is described 
in examples 1. 2 and 3. respectively. The use of site-specific recombination or transposition 
allows positioning of the first and the second heterologous nucleotide sequence from a 
construct at the same loci on homologous chromosomes, which Is most favorable for controlled 
distribution of a trait of interest to cross-progeny. A schematic representation of such a 
construct (in the T-DNA of a vector for Agrobacterium-mediated transfomiation) and excision of 
one or ttie other of said heterologous nucleotide sequences wtth the use of two different plant 
transposon systems {Spm/dSpm and Ac^s) is shown in Figure 6. Here, tine two components 
(heterologous nucleotide sequences) of Intein trans-splicing system are located on ttie same T- 
DNA (Fig.6 A), but flanked by different transposon ends (Ds or dSpm) recognized by different 
transposases. Ac or Spm. respectively. In brief, the construct in the T-DNA has two non- 
autonomous transposabte etements witti overiap at one end. The exposure of a plant or of plant 
cells carrying such construct to a Spm or Ac transposase source, will lead to tiie excision of tiie 
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fragment flanked by the Ds sequences (exposure to Ac transposase), or of the fragment 
flanked by the dSpm sequences (exposure to Spm transposase). The resulting T-DNA 
structures are shown in Fig. 6. B and C. respectively. These resulting constructs are stable 
even in the presence of Spm (in case of B) or Ac (in case of C) transposase, as one of the two 
ends of the non-autonomous transposon required for transposition, is excised together with the 
other non-autonomous transposable element. Such stabilization of the remaining transposable 
element is very useful, especially in the case of plants carrying an endogenous transposase 
source, e.g. com in case of Ac or Spm transposase. The scheme of transposon-based 
selective removal of unwanted DNA fragments is shown in Rg. 7. Here, transposition also 
leads to removal of other unwanted sequences, e.g. a selectable (SM) and a counter-selectable 
marker (CSM) genes, thus facilitating the screening for plants/plant cells carrying only the 
required heterologous nucleotide sequence (hDNA 1 or 2). One of the possible schemes for 
generating plants with different heterologous nucleotide sequences in allelic relation Is shown 
in Rgure 8. These examples with selectable marker genes is not limited to genes conferring 
antibiotic or herbicide resistance. An extensive list of such genes Is shown below. Examples of 
some counter-selectable marker genes applicable to plant systems, bacterial codA and 
cytochrome P-450 (Kopreck et al.. 1999. Plant J., 19. 719-726; Gallego etal., 1999. Plant Mol 
Biol.. 39. 83-93). are described in a number of papers, including their application in combination 
with transposon systems (Tissier et al.. 1 999. Plant Cell. 11, 1 841-1 852). 

There are well studied transposon systems for plants that are abundantly described in 
the literature (for reviews see: Dean et al.. 1991 . Symp Soc Exp Biol., 45. 63-75. Walbot. V.. 
2000. Plant Cell Physiol.. 41. 733-742; Fedoroff. N.. 2000. Proc Natl Acad Sci U S A.. SL. 7002- 
7007). The Ac/Ds (Briza etal.. 1995. Genetics. 141. 383-390; Rommens et al.. 1992. Plant Mol 
Biol.m. 61-70; Sundaresan etal.. 1995.Genes Dev.. 3.1797-810; Takuml. S. 1996. Genome, 
3a. 1169-1175: Nakagava et al., 2000. Plant Cell Physiol.. 41. 733-742) and Spm/dSpm 
(Garden et al.. 1993. Plant J. 3. :773-784; Aarts et al.. 1995. Mol Gen Genet. 24Z. 5555-64; 
Tissier et al.. 1999. Plant Cell, IL 1841-1852) systems are well established for transposon 
tagging in many plant species including many crop plants, and their adoption for practicing this 
invention Is routine for those familiar witii the art. This Invention is not limited to Ac/Ds and 
Spm/dSpm systems. Actually, any transposon system active in plants employing a "cut-and- 
paste- (excision and reinsertion) mechanism for Its transposition can be employed in tills 
invention. 

In the examples, we used Agrofeacferfu/iHnedlated T-DNA delivery in plant cells. 



I 
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whereby said T-DNA contains said first and/or said second heteologous nucleotide sequence 
as a vector. Different methods may be used for the delivery of vectors into plant cells such as 
direct introduction of said vector into cells by means of microprojectile bombardment, 
eleclroporation or PEG-mediated transfomiation of protoplasts. /\groi>acfe/7t/m-mediated plant 
transformation is prefen^d. Thus. DNA may be transfomied into plant cells by various suitable 
technologies such as by a Ti-plasmid vector canied by Agrobacterium (US 5.591.616; US 
4.940.838; US 5,464.763), particle or microprojectile bombardment (US 05100792; EP 
00444882B1; EP 00434616B1). In principle. oVner plant transfomiation methods can also be 
used e.g. microinjection (WO 09209696; WO 09400583A1; EP 175966B1). electixjporation 
(EP00564595B1; EP00290395B1 ; WO 08706614A1). etc. The choice of ttie transformation 
mettiod depends on the plant species to be transfomied. For example, microprojectile 
bombattlment may be preferred for monocots transfomiation. while for dloots. Agrobacterium. 
mediated transfomiation gives generally better results. 

The trans-splicing system described in our invention comprises two fragments, which 
are provided in bans and are located in allelic positions on homologous chromosomes. This 
means ttiat our system is better controlled and safer, e.g. it can have zero trait expression level 
in the uninduced state and zero trait transfer during cross-pollination wifli other plants. 

Genes of interest, or fragments tiiereof. ttiat can be expressed. In sense or antisense 
orientation, using ttiis invention include: staroh modifying enzymes (starch synttiase. starch 
phosphorylation enzyme, debranching enzyme, starch branching enzyme, staroh branching 
enzyme II. granule bound starch synthase), sucrose phosphate synthase, sucrose 
phosphorylase. polygalacturonase, polyfructan sucrase. ADP glucose pyrophosphorylase. 
cydodextrin glycosylti^nsferase. foactosyl tiansferase. glycogen synttiase. pectin esterase, 
aprotinin. avidin. bacterial levansucrase. E.co// gIgA protein, MAPK4 and orthologues. nitrogen 
assimilation/methabolism enzyme, glutamine synttiase. plant osmotin, 28 albumin, ttiaumatn, 
site-specific rocombinasefintegrase (FLP. Cre. R recombinase, Int, SSVI Integrase R. Integrase 
phiC31. or an active fragment or variant ttiereof). Isopentenyl transferase. Sea M5 (soybean 
calmodulin), coleopteran type toxin or an Insecticidally active fragment, ubiquitin conjugating 
enzyme (E2) fusion proteins, enzymes that metabolise lipids, amino acids, sugars, nude.c 
acids and polysaccharides, superoxide dismutase. Inactive proenzyme fomi of a protease, 
plant protein toxins, traits altering fiber In fiber producing plants. Coleopteran active toxin from 
BacHlus thurhglensis (Bt2 toxin, insecticidal crystal protein (ICP). CrylC toxin, delta endotoxin, 
polyopeptide toxin, protoxin etc.). insect specific toxin AalT. cellulose degrading enzymes. E1 
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cellulase from Acidothermus celluloticus. lignin modifying enzymes, cinnamoyl aicoliol 
dehydrogenase, trehalose-6-phosphate synthase, enzymes of cytoldnin metabolic pathway. 
HIVIG-CoA reductase, E. coli inorganic pyrophosphatase, seed storage protein, Erwinia 
herbicola lycopen synthase. ACC oxidase, pTOM36 encoded protein, phytase. ketohydrolase. 
acetoacetyl CoA reductase, PHB (polyhydroxybutanoate) synthase, acyl carrier protein, napin, 
EA9, non-higher plant phytoene synthase, pTOM5 encoded protein. ETR (ethylene receptor), 
plastidic pymvate phosphate dil<lnase. nematode-inducible transmembrane pore protein, trait 
enhancing photosynthetic or piastid function of the plant cell, stilbene synthase, an enzyme 
capable of hydroxylatlng phenols, catechol dioxygenase. catechol 2.3-dioxygenase, 
chloromuconate cycloisomerase. anthranilate synthase. Brassica AGL15 protein, fmctose 1.6- 
biphosphatase (FBPase). AMV RNA3. PVY replicase. PLRV repllcase. potyvirus coat protein. 
CMV coat protein. TMV coat protein, luteovims replicase, MDMV messenger RNA, mutant 
geminiviral replicase. Umbellularia califomica C12:0 preferring acyl-ACP thioesterase. plant 
C10 orC12:0 preferring acyl-ACP thioesterase. C14:0 preferring acyl-ACP thioesterase (luxD). 
plant synthase factor A, plant synthase factor B, 6-desaturase. protein having an enzymatic 
activity In the perxjxysomal -oxidation of fatty acids in plant cells. acyl-CoA oxidase. 3-ketoacyl- 
CoA thiolase. lipase, maize acetyl-CoA-carboxylase. 5-enolpyruvylshlkimate-3-phosphate 
synthase (EPSP). phosphinothricin acetyl transferase (BAR. PAT), CP4 protein, ACG 
deaminase, ribozyme. protein having posttranslational cleavage site, protein fusion consisting 
of a DNA-binding domain of Gal4 transcriptional activator and a transcriptional activation 
domain, a translational fusion of oleosin protein with protein of Interest capable of targeting the 
fusion protein into the lipid phase, DHPS gene conferring sulfonamide resistance, bacterial 
nitrilase. 2,4-D monooxygenase. acetolactate synthase or acetohydroxyacid synttiase (ALS. 
AHAS), polygalacturonase, bacterial nitrilase. fusion of amino terminal hydrophobic region of 
a mature phosphate translocator protein residing In the inner envelope membrane of the piastid 
with protein of interest to be targeted into said membrane etc. 

Any human or animal protein can be expressed, notably In hybrid seeds and plants 
grown therefrom, using the trBPS-splidng system of the Invention. Examples of such proteins of 
Interest include inter alia the following proteins (phannaceutical proteins): immune response 
proteins (monoclonal antibodies, single chain antibodies. T cell receptors etc.), antigens, colony 
stimulating factors, relaxins. polypeptide homiones. cytokines and their receptors, interferons, 
growtii factors and coagulation factors, enzymatically active lysosomal enzyme, fibrinolytic 
polypeptides, blood clotting factors, trypsinogen. 1-antitiypsln (AAT). as well as function- 
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conservative proteins iilce fusions, mutant versions and synthetic derivatives of the above 
proteins. 

The process of the invention may further comprise expressing a gene encoding a post- 
transcriptionai gene silencing (PTGS) suppressor protein or a function-conservative variant or 
fragment ttiereof in a plant for suppressing PTGS of said transgenic coding sequence. Said 
PTGS suppressor protein gene or functlon-conservalive variant or fragment thereof may be 
provided to a plant on the same vector canying said transgenic coding sequence or on an extra 
vector. Said PTGS suppressor protein is preferably of viral or plant origin. Examples of PTGS 
suppressor proteins are potato virus X p25 protein, african cassava mosaic virus AC2 protein, 
rice yellow mottle vims P1 protein, tomato bushy stunt virus 1 9K protein, rgs CAM or a function- 
conservative variant or fragment of one of these proteins. Said function-conservative variant or 
fragment preferably has a sequence identity of 75%, preferably at least 75%. to one of the 
above proteins. Details on PTGS suppressor proteins and their use can be found in 
WO0138512. 

The Invention further provides a transgenic multi-cellular plant organism expressing a 
trait of interest, said organism having a controlled distribution of said trait to progeny, wherein 
expression of said trait involves production of a protein molecule by trans-splidng of polypetlde 
fragments, whereby said polypeptide fragments are encoded on different heterologous 
nucleotide sequences. Said different heterologous nucleotide molecules are incorporated on 
homologous chromosomes of this plant Preferably, said polypeptides form, after trans-splldng 
or other specific polypeptide interaction, a heterologous protein. 

The invention further comprises parts or products of the transgenic plant organisms of 
the invention and plant seeds obtained by said hybridising. Further, plants or plant material 
(notably seeds or cell thereof) obtained or obtainable according to step (I) or step (ii) of claim 1 . 
Moreover, vectors for performing the process of the invention are provided, whereby said 
vectors comprise the parent heterologous nucleotide sequence as defined herein. Further, 
vectors for performing the process of the invention are provided, notably those shown in the 
figures and those used in the examples of the invention. 

In summary, we propose trait/gene lock systems that are based on a modular principle 
of providing for trait by assembly of non-functional protein fragments or sub-units into a 
functional protein. Such systems rely on genetic control of the trait of Interest by at leasttwo loci 
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that segregate independently during crosses, especially during illicit sexual crosses or In the 
process of a hypothetical horizontal transfer. Based on the present invention, such locks rely on 
functional protein assembly when the necessary loci are present and expressed in the same 
cell or in the same organism. Based on our invention, such gain of function is preferably 
achieved through protein trans-splicing. It was shown before, that intein-mediated trans-splicing 
allows for functional protein assembly from non-functional protein fragments in vitro (Mills et al.. 
1998. Proc. Natl. Acad. Sci. USA, 95. 3543-3548). as well as in different microorganisms 
(Shingledecker et al.. 1998. Gene. 2QZ. 187-195; Wu ef al., 1998. Proc. Natl. Acad. Sci. USA, 
§5. 9226-9231; Sun et al., 2001. Appl. Environ. Microbiol., 6Z. 1025-29; Chen ef al., 2001. 
Gene, 263. 39-48). The present invention, however, is not limited to protein trans-splicing as 
the mechanism of functional protein assembly. Such functionality leading to a trait of interest 
can be achieved also by providing different subunits of a heteromeric protein, as long as (a) Vne 
functionality of the protein of interest depends on the presence of the subunits In questions and 
(b) the genes for components of are encoded in such a way as to allow for preventing illicit 
crosses. 

The invention also allows to assemble sequence coding for a protein from modules of 
e.g. signal peptides, binding domains, retention signals, trans-mambrane signals, activation 
domains, domains with enzymatic activities. afRnity tags, and regulatory sequences. Such a 
modular approach makes it simple to find an optimal expression cassette fora specific purpose 
or for finding an optimal secretory or transit peptide for a specific gene to be over-expressed 
and accumulated In the cell or a specific compartment thereof. It can be a valuable tool for 
functional genomics and proteomics studies. A library of plants may e.g. be created, whereby 
each member of the library contains a particular module (e.g. a specific signal peptide) of one 
of tiie above module classes e.g. as said first fragment The second fragment will then code for 
a protein of Interest. Following said hybridizing, the sequence of said protein is linked to said 

module by trans-splicing. 

Protein splicing, can occur only between at least two genetically designed loci, it occurs 
in vitro with a very high efficiency, thus allowing for quantitative splicing of parental 
polypeptides, and It can occur between polypeptides that are encoded in different organellar 
genomes, such as nuclear genome, plastid or mitochondrial genomes, or extrachromosomal 
episomes. as long as the franslated polypeptWes are targeted to the same organelle. 
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it should be mentioned that the technology described herein can similariy be applied to 
multl-cellular animals. Humans are excluded. 



EXAMPLES 
EXAMPLE 1 

|nteln-mediated trans-s pHrana of GFP 

The 5- end of the GFP gene was amplified by PGR using primers 35spr3 (cgc aca ate cca da 
tec ttc g) and gfpprS (ctg ctt gtc ggc cat gat ata g) from plasmld plCH5290 (35S^mega leader- 
gfp coding sequence-ocs temilnator in Icon Genetics binary vector pICBVI containing BAR for 
plant selection. Fig. 9). A DNA fragment containing the Otemiinal end of the DNAE inleln finom 
Synechocystis was amplified by PGR from genomic DNA (Strain PCC6803 from the American 
Type Culture Center) using primers gfpinte! (eta tat cat ggcc gac aag cag aag ttt gcgg aatat 
tgcc tcagt) and intepr2 (ttt gga tec tta ttt aat tgt ccc age gtc aag). A fusion of the GFP and Intein 
fragments was made by PGR using previously amplified DNA fragments and primers 35Spr3 
and intepr2 for the second amplification. The PGR product was cloned as a Noo1-BamHI 
fragment in plCBV16 (Icon Genetics binary vector with Nptll for plant selection Hg. 11; other 
bindan^ vectors may also be used). The resulting plasmld. pICSgfpInt (Fig 3). was checked by 
sequencing. 

The 3- end of the GFP gene was amplified by PGR using primers gipprQ (aag aac ggc ate aag 
gtg aac) and nosterrev (tea teg caa gac egg caa cag g) from plasmld plGH5290. A DNA 
fragment containing the N-temiinal end of the DNAE intein from Synechocystis was amplified 
by PGR from genomic DNA using primers inteprS (ttt cca tgg tta aag tta teg gtc gtc) and 
integfp2 (gtt cae ctt gat gee gtl ctt aca att ggc ggc gat cgc eec att). A fusion of the Intein and 
GFP fragments was made by PGR using previously amplified DNA flragments and primere 
inteprS and nosferrevfbr the second amplification. The PGR product was cloned as a Nco1- 
BamHI fragment In plCBV16. The resulting plasmld. plGintgfp3' (Fig. 3) was checked by 
sequencing. 

The GFP gene product that results from Inteln-medlated transplicing contains sbc additional 
amino acids (KFAEYG) between aminoaelds 1 56 (K) and 157 (Q). This insertion was shown to 
not slgnificanUy affect GFP fluorescence (Ozawa et al., 2001 . Anal. Cham.. 22. 5866-5874). 
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pICS'gfpint and plCintgfp3' were transformed in agrobacterium strain GV3101 by 
electroporation. Botfi constructs were co-expressed transiently in Nicotiana benthamiana 
leaves using Agmbacterium-mediaied transient expression (Vaquero et al., 1999, Proc. Natl. 
Acad. Sci. USA, 96, 1 1 1 28-1 1 1 33). GFP fluorescence was detected when both constructs were 
co-expressed but not when constnjcts were expressed indlviduaily. 

Both constoicts were also transformed in Arabidopsis thaliana (Col-0) plants as described by 
Bentefa/. (1994, Sc/ence, 285.1856-1860). Seeds were harvested three weeks after vacuum- 
infiltration, and genninated and screened for transfonnants on plates containing 50 mg/L 
Kanamycin. The same constmcts were also used for Agrobacterium-med\a\Bd leaf disc 
transformation of Nicotiana tabacum plants (Horsh et al., 1985, Science, 221* 1229-1231) 
using 50 mg/L of Kanamycin for selection of transfonnants. In tobacco arKl Arabidopsis, GFP 
fluorescence could not be detected in transfonnants with either construct alone, but was 
detected in plants containing both transgenes. 



EXAMPLE 2 

Intein-mediated trans-s plicina of EPSP 

The enzyme 5-enolpyruvylshiklmate 3-phosphate synthase (EPSP synthase) catalyses the 
fonnation of 5-enolpyruvylshlkimate 3-phosphate from phosphoenolpyruvate and shikimate 3- 
phosphate. EPSP synthase Is the cellular target of the herbicide glyphosate (N- 
phosphonomethylglyclne). A mutant allele of the aroA gene of Salmonella typhimurium with a 
PI 01 S mutation encodes an EPSP synthase with decreased activity to glyphosate. Expression 
of this gene In plants confere resistance to glyphosate (Comal et al. 1985. Nature. SIZ. 741- 
744). The 5* end of the mutant EPSP gene was amplified by PGR from Salmonella typhimurium 
genomic DNA (prepared from strain ATCC 39256 from the American Type Culture Center) 
using primers epspl (tctcc atg gaa tec ctg acg tta caa c) and epsppr2 (acc tgg aga gtg ata ctg 
ttg). A DNAfragment containing the C-tennlnal end of the DNAE Intein from Synechocystis was 
amplified by PCR from plC5'gfplnt using primers inteprS (caa cag tat cac tct cca ggt aag ttt gcg 
gaa tat tgc etc agt) and Intepr2. A fusion of the EPSP and Intein fragments was made by PCR 
using previously amplified DNA fragments and primers epspl and inteprZ. The PCR product 
was cloned as a Ncol-BamHI fragment In plCH5300 (Icon Genetics binary vector with BAR 
gene for plant selection. Fig. 10). The resulting plasmid. plC5'epsp-int (Rg 4). contains the 
EPSP-N-lntein fusion under control of the 35S promoter, fused translatlonally to an artificial 
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chlotxjplast transit peptide (massm Issaav vatra saaqa smvap ttglk saasf pvtrk qnnid itsia snggr 
vqca). plC5'epsp-int was checl<ed by sequencing. 

The 3' end of the EPSP gene was amplified by PCR from Salmonella typhimurium genomic 
DNA using primers epsp3 (cgc tat ctg gtc gag ggc gat) and epsp4 (egg ggatcc tta ggc agg cgt 
act cat tc). A DNA fragment containing the N-temninal end of the DNAE intein from 
Synechocystis was amplified by PCR from plCintgfp3' DNA using primers inteprS and Intepr6 
(ato gcc etc gac cag ata gcg gga ttt gtt aaa aca att ggc ggc gat). A fusion of the intein and 
EPSP fragments was made by PCR using previously amplified DNA fragments and primers 
inteprS and epsp4. The PCR product was cloned as a Nco1-BamHI fragment In plCH5300. 
The resulting plasmid. plCint-epsp3' (Fig 4). contains the C-intein-EPSP fusion under control of 
the 35S promoter, fused translationally to the artificial chloroplast transit peptide. plClnt-epsp3' 
was checl^ed by sequencing. 

The EPSP gene product that results from Inteln-mediated trans-spllcIng contains ten additional 
aminoacids (KFAEYCFNKS) between aminoadds 235 (G) and 236 (R). It has been previously 
shown that this position In the EPSP gene can accommodate insertions of at least 5 to 12 
amino acids without compromising gene function (Chen etal., 2001, Gene, 2§3. 39-48). 
plC5'epsp-int and plClnt-epsp3' were transfomied in agrobacterium strain GV3101 by 
electroporation. Both constaicts were transformed In Arabldopsis thaliana (Col-0) plants as 
described by Bent et al. (1994. Science. 2^ 1856-1860). Seeds were han^ested three weel« 
after vacuum-infiltration, gemilnated In soil and screened for transformants by spraying several 
times with a solution containing 50 mg/L phosphinothricin (PPT). 

The same constmcts were also used for /\grof)acfe/fu/THnediated leaf disc transfonnatlon of 
Nicotiana fabacum plants (Horshef a/., 1985. Sc/ence. 222L 1229-1 231) using lOmg/L of PPT 
for selection of transfonnants. Transfonnants wer© checked for EPSP gene activity by spraying 
plants with a commercial fomiulatton of glyphosate (N-phosphonomethylglycine). For both 
Arabldopsis and tobacco, transfonnants containing either constructs alone did not exhibit 
glyphosate resistance. F1 plants containing both constructs were resistant to glyphosate. 

EXAMPLE 3 

Intain-medigteH assembly nf ftinrfional EPSP without tranS-SPliOinq 

pICS-epsp-intM Is similar to construct plC5'epsp-int but differ at the junction EPSP-N intein by 
the addition of 4 native N extein amino acids instead of five and by the first N intein aminoacid 
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Which was changed from Cys to Ala. PIC5'epsp-intM was made following the same strategy as 
for PIC5'epsp-int except that primer intepr? (caa cag tat cac tct cca ggt ttt gcg gaa tat gcc etc 
agt ttt ggc ac) was used instead of primer inteprS. 

PlClnt-epspS'M is similar to construct PICint-epsp3' but differ at the junction C Intein-EPSP by 
the addition of 3 C extein amino acids instead of five, the first one mutated from Cys to Ala and 
the two other native, and by the last C intein aminoacld which was changed from Asn to Ala. 
PlCint-epspS'iy^ was made following the same strategy as for PICint-epsp3' except that primer 
inteprS (ate gcc etc gac cag ata gcg gtt aaa age age ggc ggc gat cgc ccc att g) was used 
instead of primer intepr6. 

The three mutated aminoaeids completely prevent Intein mediated transpliclng but do not 
prevent association of the N and C intein firagments ((Chen et al.. 2001. Gene, m 39-48). 
plC5'epsp-intM and plCint-epsp3'M were transfbmied in agrobaclerium strain GV3101 by 
electroporation. Both constructs were transfomied In Arabldopsis thaliana (Col-0) and tobacco 
as described above. Primary transfbmiants were all sensitive to glyphosate. but hybrid F1 
plants containing both constructs, either in tobacco pr Arabldopsis. exhibited glyphosate 
resistance. 

EXAMPLE 4 

Splitting the Arabido p^^'g AHAS gene 

The acetolaclate synthase (AHAS) gene from Arabldopsis (Genbank accession AY042819) was 
amplified from Arabldopsis genomic DNA using primers Alsl (5' taaaccatgg eggeggcaac 
aacaac 3') and Als2 (5' gactctagae cggtttcate telcagtatt taatc cggcc atctco 3') and cloned as an 
Nco1-Xba1 fragment In loon Genetics binary vector plCBV24 (Kan« selection in £.co// and 
Agrvbacterium). Ser653 was mutated to Asn by PCR using primers AlsmS (5' caggacaagt 
ctctcgtcgt atg 3'). Als4 (5' gaaagtgcca ccattcggga tcatcg 3'). Als3 (5' cgatgatccc gaatggtggc ac 
3') and Als2. The amplified mutated fragment was cloned as an Nhe1 - Age1 fragment A 
second aminoacld. Pro197 was mutated to Ser by PCR using primers Als1. AlsmS. Alsm6 (5' 
acgacgagag acttgteotg tg 3') and Alsprl. The amplified mutated fragment was subeloned as a 
Sapl-Mlul fragment 

The rice actini promoter was amplified by PCR from rice genomic DNA using primers 
Aotpri (5' atgggcgcgc cagatclgca tgccggtcga ggtcattcat atgettgag 3') and Actpr2 (5' cgceatggtt 
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tatcgatagcttatcgtcta cctacaaaaa agctccgcac g 3'). The PGR product was cloned upstream of 
the AHAS gene as an Ascl-Ncol fragment The resulting plasmid, plCH12690 (Fig. 16) contains 
the rice actini promoter followed by the Arabidopsis AHAS coding sequence with two mutated 
aminoacids, and the Nos temiinator. 

The mutated Ahas gene was split into two parts using the Synechocystis sp. PCC6803 
DnaE intein. To test a position for splitting AHAS, aminoacids RAEELLK (aminoacids 428 to 
434) were replaced by aminoacids DVKAYCFNKKG using PGR with primers AlsmS. Alsm4 (5' 
ggccatggtt aaaacaatat tccgcaaact tgacgtcgtt ctcaagaacc ttattcatcc 3'), Alsm3 (5' gcggaatatt 
gttttaacca tggccttgat tttggagttt ggagg 3') and Nostenrev (5* tcatcgcaag accggcaaca gg 3'). This 
substitution results in a protein that is similar to the protein that would be produced by intein- 
mediated trans-splicing of the constructs described below (plCH12610 and plCH12650. see 
Fig. 17). The mutated fragment was subcloned as a BspEI-Scal fragment The resulting 
plasmid, PIGH12600 (Fig. 16). was tested for AHAS activity by bombardment of Trtticum 
monococcum cell suspension cultures and selection on plates conlalning 0.5 to 3 microMolar 
sulfometuron methyl (Sigma). 

The Intein-N part of the DnaE intein was amplified by PGR from Synechocystis genomic 
DNA with primers IntNl (5' gcaagcttga cgtcaagttt gcggaatatt gcctcagt 3') and lntN3 (5" 
cgtctagagt cgacctgcag ttatttaatt gtcccagcgt caag 3'). and subcloned Into plCH12600 (Fig. 16) 
as a Aat2-Xba1 fragment The resulting plasmid. plCH12610 (Rg. 17). contains the N part of 
the AHAS gene fused to the inteln-N fragment 

A PGR fragment containing the Intein-C part of the DnaE intein was amplified from 
Synechocystis genomic DNA with primers Ctintel (5' ggtclagaatcgatggtlaaagttatcggtcgtcg 3') 
and /rrfC2 (5* cgccatggtt aaaacaattg gcggcgatcg c 3'). A PGR fragment containing an artificial 
chloroplast targeting signal (sequence: massmlssaa watrasaaq asmvapftgl ksaasfpvtr 
kqnnldltsi asnggn/qca) was amplified from plCH5300 with primers Spr3 (5' cgcacaatcc 
cadatcctl eg 3') and Ctinte2 (5* cttlaaccat agcgcatlga actcttcctc c 3'). A fragment containing a 
fusion chloroplast targeting signal-intein-C fragment was obtained by amplification from both 
fragments with primers Spr3 and lntC2. This fragment was cloned using Glal and Ncol Into 
PIGH12600 (Fig. 16). The resulting plasmid plGH12650 contains the fusion artificial chloroplast 
targeting signal- DnaE intein C-AHAS G firagment under control of the rice actini promoter, in 
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a binary vector. To test the functionality of the split AHAS gene, plCH12610 and piCH12650 
(Rg, 17) were co-bombarded into Triticum monococcum cell suspension cultures and the cells 
selected on media containing 0.5 to 3 microMolar sulfometuron methyl. 



EXAMPLE 5 

Spllttina the RARNASE oene 

The bamase gene was split using the Synechocystis sp. PCC6803 DnaB intein. 
fragments for the N and C tenninal parts of Bamase flanked by appropriate restriction 
were chemically synthesized by a commercial DNA-synthesls company. 



The sequence of the N terminal end Is: 

5- gcaatcgatg gcacaggtta tcaacacgtt tgacggggtt gcggattatc ttcagacata tcataagcta cctgataatt 
acattacaaa atcagaagca caagccctcg gctgggacgt ccgc 3' 

The sequence of the C terminal end Is: 

5- cgccatgggg tggcatcaaa agggaacctt gcagacgtcg ctccggggaa aagcatcggc ggagacatct 
tctcaaacag ggaaggcaaa ctcccgggca aaagcggacg aacatggcgt gaagcggata ttaactatac atcaggctlc 
agaaattcag accggattct ttactcaagc gactggctga tttacaaaac aacggaccat tatcagacct ttacaaaaat 
cagataagga tccgc 3'. 

The N terminal end of Bamase was fused to the N part of the DnaB Intein. The DnaB 
Intein-N fragment was amplified from Synechocystis DNA using primers DnaBintNprI (5' 
gtAAGCTTGA CGTcagagag agtggatgca tcagtggaga tag 3') and DnaBintNpr2{5- caCTGCAGct 
ataattgtaa agaggagctt tctag 3'). The Bamase fragment (a Clal Aatll fragment) and the intein 
fragment (a Aatll PstI fragment) were cloned in an Icon Genetics binary vector resulting In clone 
plCH12790. 

The C terminal end of Bamase was fused to the C part of the DnaB intein. The DnaB 
inteln-C fragment was amplified from Synechocystis DNA using primers dnaBintCprI (gt CTG 
GAG ATC GAT TCA TGA gcc cag aaa tag aaa agt tgt etc) and dnaBintCpr2 (tc AAG CTT CCA 
TGG tct tgc tct tea ctg tta tgg aca atg atg tea t). The Intein fragment (a Sad Ncol fragment) and 
the Bamase fragment (a Ncol BamHI fragment) were cloned in an Icon Genetics binary vector, 
resulting in clone plCH12820. 
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Functionality of the N and C temiinal Bamase-lntein fusion clones was tested by 
agroinfiltratlon of Nicotiana benthamiana leaves. As expected the Infiltrated sector became 
necrotic. 

To reduce activity of Bamase. a frameshift was introduced in the N part of the Bamase 
gene. A PCR was perfonned on plCH12790 with primers Bampr4 (5' gcaatcgatg gcacaggtta 
ttcaacacgt ttgac 3') that contains the framshifl. and BamprS (5' gcggacgtcc cagccgaggg ctlgtgc 
3-) and subcloned in plCH12790 resulting in plasmid plCH12800. The tapetum^pedfic 
promoter (Genbank Number D21160: Tsuchia ot al.. 1994. Plant Mol Biol.. 2S. 1737-1746) was 
amplified from rice genomic DNA using primers Tapprl (5' cggaattcgg cgcctttttt ttacacagtt 
caaagtgaat tttgg 3') and Tappr2 (5' cgcatcgatg cttaattagc tttggttaat tggag 3') and subcloned In 
PICH12800 as an EcoRI-CIal fragment, resulting in plasmid plCH12830 (Rg. 18). The rice 
tapetum-specific promoter was subcloned from plCH12830 (Rg. 18) Into plCH12820 as an 
EcoRI-Clal fragment. The resulting i^onstruct plCH12840 (Rg. 18) contains the intein<J- 
Bamase-C fusion under control of the rice tapetum^peciflc promoter. 



EXAIVIPLE 6 

Generation of pro-loci is nonstructs 



Assembly of all components required In the final construct was done in a stepwise 
fashion. Rtst a sequence containing an AtlP and an AttB site flanked by appropriate restriction 
sites was made from overlapping oligonucleotides and cloned In the Icon Genetics bina^r 
vector plCBV26(only contains Xhol-Clal-Xbal sites between T-DNA borders. Kan« selection .n 
E.CO// and Agmbacterium). The resulting sequence (agatctgtgc cccaactggg gtaacctttg agttctctc 
agttgggggc gtagggaatt ctgtctgcag tctagattta tgcatggcgc gcclatctcg agctcgaagc cgcggtgcgg 
gtgccagggc gtgcccttgg gctccccggg cgcgtactcc acctcaccca tcactagtig tggtaccatc gcagggccc) .s 
prosent In constnict plCH12920. The N-temiinal Bamase-lntein fragment (EcoRI-Pst1 fragment 
from PICH12830. Rg. 18). the Ahas-lntein fragment (AsclXhol fragment from plCH12610. Rg. 
17) and the Ocs terminator (an Xbal Pst1 fragment from plCH12900) were subcloned .n 
PICH12920. The resulting done plCH12950 (Rg. 19) contains both N-temilnal Bamase and 
Ahas fragments flanked by AttP and AttB sites in binary vector. 
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Next, a sequence containing an AttP site flanked by appropriate restriction sites was 
made from overlapping oligonucleotides and cloned in plCBV26. The resulting construct 
plCBV12850 contains the sequence (ggtacctgca gtattctaga ttcgaattct cgagtgtggc gcgccgtgcc 
ccaactgggg taacctttga gttctctcag ttgggggcgt agggccct) on the T-DNA. The C-termlnal inteln- 
Bamase fragment (an EcoRI-BamHI fragment from plCH1 2840. Fig. 1 8). tiie Ocs terminator (a 
BamHI-Pst1 fragment from plCH12900) and the C-temiinal inteinnAhas firagment (an AscI Xhol 
fragment from plCH12650) were subcloned in plCH12850. The resulting clone plCH12910 (Rg. 
19) contains both C-terminal Bamase and Ahas fragments and an AttP site In binary vector. 

C-tenninal and N-temiinal fragments were combined In one binary vector by subdonlng 
a Kpnl Apal fragment from plCH12910 into plCH12950 (Rg. 19). resulting In plCH12960 (Rg. 
21). 

Selectable marker for transfomnation: 

A HPT gene under control of ttie maize ublqultin promoter was used for plant 
transfomiation. To faciTrtate selection, an Intron was inserted Into HPTcodlng sequence. Rrst 
a target site for cloning was Inserted into the HPT coding sequence by amplifying a PCR 
fragment from plC052 (HPT coding sequence-Nos temninator in pUC19) wWh overlapping 
primers Bamhpt (5' cgggatccaa tcagatatga aaaagcctga ac 3'). Hptinti (5' ccacaactgt 
ggtctcaagg tgcttgacat tggggagttc ag 3'). Hpant2 (5' ggatatcggt ctcgtacctc cggaatcggg agcgcgg 
3') Sgfhpt (5- cgcagcgatc gcatccattg cctccgcgac cggctggaga acagcg 3*). and Inttarg (5' 
aggtacgaga ccgatatcca caactgtggt ctcaaggt 3'). and subdonlng the amplified fragment as a 
BamHI Sgfl fragment into plC052. An Intron was amplified firom petunia genomic DNA with 
primers IntpetS (5' gtclggtctc aggtaagttc tgcatttggt tatgctcctt gcattt 3') and Intpet4 (5' gtctggtctc 
tacctgtagc aataattaaa acaaaaata 3') and cloned as a Bsa1 fragment In ttie plasmid described 
above, resulting in plasmid plCH12710. The maize ublqultin promoter was amplified by PCR 
from genomic DNA using primers Ubi1 (5' ttgcatgcct gcagt gcagc gtgacccggt c 3') and Ub,2 (5' 
gggatcctct agagtcgacc tgcagaagta acaccaaaca acagggtg 3') and cloned as a Sphl-BamHl 
fiagment together with HPT (a BamHI Xbal fragment from plCH12710) in plCH12720 (an 
intemiedlate construct containing restriction sites and an AttB site; sequence 5' tctaagctac 
tcgagactag tgcatgctgt tctagactcg aagccgcggt gcgggtgcca gggcgtgccc ttgggctccc cgggcgcgta 
ctccacctca cccatcggta ccg 3'). The resulting construct plCH12870 (Rg. 20) contains the 
hygromycin gene wltti an intron fused to the maize ublqultin promoter, followed by an AttB site. 
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Finally, the HPT gene was subcloned as a Kpn1-Spe1 fragment into plCH12960 (Fig. 
21). The resulting construct plCH12970 (Fig. 21) contains the N and C-tenninal ends of Ahas 
and Bamase fused to intein fragments as well as the HPT selection marker, two AttP sites and 
two AttB sites. 



EXAMPLE 7 

Constmctinq intearase clones 

plCH13160 (Fig. 20) was made by cloning the Streptomyces Phage C31 integrase 
(From David Ow. Plant Gene Expression Center. US Department of AgriculturB^ricultural 
Research Service. Albany. CA 94710. USA) and the Spm promoter (amplified by PCR from^ 
plC028 with primers Spmprfwd (5' cgtctagagt caaaggagtg tcagttaatt a 3') and Spmprrev (5' 
cgctgcagtg cttggcgagg ccgccc 3') in an Icon Genetics binary vector (selection In agrobacterium 
and E.coli: Kan**). 

The maize ubiquitin promoter was subcloned firom plCH12720 as a BspD1-blunt Psti 
fragment into plCH13160 (Fig. 20) digested with AscI -blunt and Psti. The resulting plasmid. 
PICH13130 (Rg. 9). contains the integrase under control of the maize ubiquitin promoter. 

EXAMPLES 

fif^nfiration of tranaaenir plants with Dre>-locus 

The PICH12970 (Rg.21). plCH13130 (Fig. 20) and plCH13160 (Fig. 20) constructs were 
transformed Into maize, rice and tobacco using Hygromycin selection. 

PICH12970 transfbrmants were sprayed with chlorsulfon (Glean. Dupont) to select 
plants that expressed the mutant split AHAS gene at a level sufficient for heit>lcide resistance 
(alternatively, construct plCH12960 (Fig. 21) can be transfomied Into plants using selection on 
chlorosulfuron or sulfbmeturon methyl, with the advantage of directly selecting transfomnants 
that express AHAS at a sufficient level). Plants that looked healthy despite the presence of ttie 
spilt Bamase gene, but that were male sterile, were analyzed by Southem blot to identify 
individuals containing a single transgene. Such transfbmiants were pollinated by plCH13130 
(Rg. 20) or plCH13160 (Rg. 20) transfonnants. The same ti^nsfomiants were also pollinated 
witii wild type plants to rescue plants with an intact non-recombined transgene locus. The F1 
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plants (PICH12970 x integrase transformants) were checked by PGR for the presence of both 
transgenes (plCH12970 transgene. see Fig. 21, and the integrase. see Fig. 20). and seeds 
were collected. The F2 seedlings were grown and screened by PGR to detect recombinants 
that lacked either the N-temiinal or the C-temiinal parts of the split Bamase and Ahas genes. 
Such plants were completely fertile. Pairs of plants containing complementary parts of the 
construct (as a result of integrase-mediated recombination) were crossed. Seedlings obtained 
from these crosses were sprayed with Ghlorsulfuron to eliminate plants that did not contain both 
parts of the constmcL All plants that were resistant to chlorsulfuron were also male sterile. 

The following methods were used for genetrating transgenic plants: 

Tobacco transformation 

The constructs were used for Agrobacferfum-medlated leaf disc transfom^ation of 
Nicotiana tabacum plants (Horsh et aL, 1985. Science, 2ZL 1229-1231) using selection on 
Hygromycin (25-100 mg/1) or sulfometuron methyl (0.5- 3.0 microM) or chloreulfuron (0.2-5.0 
microM). 

Rice transformation 

Callus cultures were induced from mature and immature embryos of rice cvs. Pusa Basmati 1 . 
Koshhikari etc. 

The culture media were based on Chu (N6) salts and vitamins (Chu et al.. Scientia Sinlca. 
18(5):659-68, 1975). 

Callus induction and propagation medium was supplemented witii 30 g/l sucrose. 600 mg/l L- 
proline. 2.0 mg/l of 2,4-D and 0.3% gelrite. 

Pro-rBgeneration medium contained N6 salts and vitamins with 30 g/l sucrose. 1 mg/l NAA. 2 
mg/l BA, 2 mg/l ABA and 0.6% gelrite. 

Regeneration medium contained N6 salts and vitamins witti 30 g/i sucrose. 0.2 mgfl NAA. 2 
mg/l BA, and 0.6% gelrite. 

Infection medium (IM) contained N6 salts and vitamins witii 2 mg/l 2,4-D, 10 g/l glucose. 60 g/l 
maltose. 50 mg/l ascorbic add. 1 g/l MES (2-N-morpholinoethanesulfonic acid) and 40 mg/l 
Acetosyringone (AS). The pH of ttie medium was adjusted to 5.2 by 1 N KOH. 
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Cocultivation medium (CM) was same as the IM (excluding ascorbic acid) and was solidified by 
adding 0.6% gelrite. 

Infection medium was filter sterilized, whereas all other media were autoclaved. AS. dissolved 
in DMSO (400 mg/ml), was added after sterilization. 

Agrobacterial cultures (strains AGL1. EHA105. LBA4404 etc.) with the appropriate 
binary plasmids were grown for 3 days at room temperature on LB2N (LB medium with 2 gfl 
NaCl and 1.5% agar) plates supplemented with the appropriate antibiotics. Bacteria were 
scraped from the plates and resuspended in the IM (1 0-20 ml) in 50-mL falcon tubes. The tubes 
^Nere fixed horizontally to a shaker platfomi and shaken at low speed for 4 to 5 h at room 
temperature. Optical density of the bacterial suspension was measured and OD600 was 
adjusted to 1.0-2.0. 

Callus pieces were incubated In the agrobacterial suspension for 20-180 min at room 
temperature, blotted on the filter paper disks and transferred to the gelrite-solidified CM with 60 
g/l maltose. After 3-6 days of cultivation on the CM the calll were washed five times by sterile 
water and transferred to the gelrite-solidified CM with 60 g/l sucrose and appropriate selective 
agent and, if needed. 150 mg/l Timentin. 

Resistant calli developed under selection were plated to the pre-regeneration medium 
with appropriate selective agent. Two weeks later the cultures were transfen«d to the 
regeneration medium with appropriate selective agent Regenerated plantlets were grown on 
half-strength N6 medium without hormones for one month before transplanting Into the soil. 

Hygromydn B. for hpt (hygromydn phosphotransferase) gene-based selecHon. was 
used at concentrations 25-100 mg/l. Selection based on the herbicldenesistent fonns of AHAS 
(Acetohydioxy ackl synthase) gene was performed on sulfometuron methyl (0.5- 3.0 microM) 
or chloreulfuron (0.2-5.0 mlcroM). 

^/lapq transfonnation 

Maize Immature embryos and callus cultures obtained from the lines A188. Hill eto. 
were transfomied essentially in the same way as rice cultures. Most of the media and 
transfomiation steps were the same. Pre-regeneration medium was not used. Regeneration 
medium contained N6 salts and vitamins. 30 g/l sucrose. 2 mg/l Zeatin and 0.05 mg/l 2.4-D. 
Silver thlosulfate was induded in the regeneration medium at concentrations 0.01-0.06 mM. 
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Claims 



1 . A process of producing a transgenic multi-cellular plants or parts thereof expressing a 
trait of interest, said trait having a controlled distribution of said trait to progeny, wherein 
said process comprises 

(I) producing a first plant or a cell thereof having in a first locus of a nuclear 
chromosome a first heterologous nucleotide sequence comprising a first 
fragment of a nucleotide sequence encoding said trait of interest, 
(il) producing a second plant or a cell thereof having in a second locus of a nuclear 
chromosome homologous to said nuclear chromosome of step (i). a second 
heterologous nucleotide sequence comprising a second fragment of the 
nucleotide sequence encoding said trait of Interest, and 
(Hi) hybridising said first and said second plant or cells tiiereof to generate progeny 
exhibiting said functional trait of Interest due to binding between a protein or 
polypeptide encoded by said first heterologous nucleotide sequence and a 
protein or polypeptide encoded by said second heterologous nucleotide 
sequence. 

2 The process of claim 1. wherein said muW-oellular plant organisms or said parts 
express two traits of Interest, a trait (1) and a trait (2). both traits having a controlled 
distribution to progeny. 

3. The process of claim 2, whereby 

(I') said first heterologous nucleotide sequence of step (i) comprises: 
a first fragment of a nucleotide sequence encoding trait (1 ) and 
a firet fragment of a nucleotide sequence encoding trait (2); and 
(li') said second heterologous nucleotide sequence of step (ii) comprises: 
a second fragment of a nucleotide sequence encoding b^it (1) and 
a second fragment of a nucleotide sequence encoding trait (2); and 
(Hi-) step (ill) comprises hybridising said first and said second plant or cells thereof to 
generate progeny exhibiting trait (1 ) and trait (2). whereby exhibiting of trait (1 ) is due to 
binding between a protein or polypeptide encoded by said first heterologous nucleotide 
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sequence and a protein or polypeptide encoded by said second heterologous 
nucleotide sequence. 

4. The process of claim 3. wherein the jsrogeny generated in step (iii') exhibits trait (2) due 
to binding between a protein or polypeptide encoded by said first heterologous 
nucleotide sequence and a protein or polypeptide encoded by said second 
heterologous nucleotide sequence. 

5. The process of one of claims 2 to 4, wherein said progeny exhibits trait (1 ) and/or trait 
(2) due to Intein-medlated trans-splicing. 

6. The process of claim 2. wherein exhibiting of trait (1 ) and/or of trait (2) Is due to RNA 
trans-splicing of an RNA expression product encoded by said first heterologous 
nucleotide sequence and an RNA expression product encoded by said second 
heterologous nucleotide sequence. 

7. The process of one of claim 2 to 6, wherein step (iii) Involves selecting progeny that 
exhibits said trait (1) and said trait (2). 

8. The process of one of claims 2 to 7. wherein trait (1 ) is a herbicide resistance. 

9. The process of clairn 8. wherein step (iii) involves selecting progeny that exhibits said 
trait (2) by applying a herbicide to said progeny, whereby said trait (1) endows 
resistance against said herbicide. 

1 0. The process of one of claims 2 to 9, wherein trait (2) Is male or female sterility. 

11. The process of one of claims 9 or 1 0. wherein step (ill) involves selecting progeny that 
exhibits male sterility as said trait (2) by applying a herbicide to said progeny, whereby 
said trait (1) endows resistance against said herbicide. 

12. The process of one of claims 1 to 1 1 . wherein two or more traits are assembled in step 
(iii) by trans-splicing. 
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13. The process of one of claims 1 to 1 2. wherein steps (i) comprises introducing said first 
heterologous nucleotide sequences into said first locus of a nuclear chromosome of a 
plant or a plant cell by site-targeted integration into a pre-engineered integration site or 
by homologous recombination. 

14. The process of one of claims 1 to 13, wherein steps (ii) comprises introducing said 
second heterologous nucleotide sequences into said second locus of a nuclear 
chromosome of a plant or a plant cell by site-targeted integration into a pre-engineered 
Integration site or by homologous recombination. 

1 5. The process of one of claims 1 to 1 4. wherein steps (1) and (il) are carried out by 

(a) introducing a parent heterologous nucleotide sequence comprising said first and 
said second heterologous nucleotide sequences into a nuclear chromosome of 
parent organisms or cells thereof, 

(b) optionally selecting organisms or cells thereof having said parent heterologous 
nucleotide sequence integrated in a desired chromosome or chromosome locus, 

(c) subsequently splitting said parent heterologous nucleotide sequence so that 
said first and said second heterologous nucleotide sequences are located on 
homologous chromosomes in different plant organisms or cells. 

16. The process of claim 1 5. wherein step (a) is carried out by homologous recombination 
or by site-targetet integration of said parent heterologous nucleotide sequence into a 
predetennined locus of a nudear chromosome. 

17. The process of claim 15 or 16. wherein step (a) or (b) are followed by producing 
organisms or cells thereof which are homozygous for said parent nucleotide sequence. 

18. The process of claim 15 or 16, wherein said organism obtained In step (a) or step (b) is 
heterozygous for said parent nucleotide sequence. 
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19. The process of one of claims 15 to 18. wherein step (c) comprises excision of said 
second heterologous nucleotide sequence from said parent heterologous nucleotide 
sequence, 

optionally followed by reintegration of said excised second heterologous nucleotide 
sequence Into a locus of a chromosome that is homologous with respect to the 
chromosome of said parent heterologous nucleotide sequence. 

20. The process of one of claims 1 5 to 1 8. wherein step (c) comprises excision of said first 
heterologous nucleotide sequence from said parent heterologous nucleotide sequence, 
optionally followed by reintegration of said excised first heterologous nucleotide 
sequence into a locus of a chromosome that is homologous with respect to the 
chromosome of said parent heterologous nucleotide sequence. 

21 . The process of one of claims 1 9 or 20. wherein the plants or cells thereof obtained In 
claim 19 or 20 or progeny thereof are analysed for said reintegration of the excised 
heterologous nucleotide sequence, 

and plants or cells thereof are selected that do not contain said excised heterologous 
nucleotide sequence or that contain said heterologous nucleotide sequence at a desired 
locus on a chromosome homologous to the chromosome harboring the heterologous 
nucleotide sequence ttiat has not been exdsed. 

22. The process of one of claims 19 to 21. wherein said first and/or said second 
heterologous nucleotide sequence In said parent heterologous nucleotide sequence 
Is/are contained In a non-autonomous transposon and said excision comprises 
providing a transposase for said transposon. 

23. The process of claim 22, wherein 

(A) said first heterologous nucleotide sequence in said parent heterologous 
nucleotide sequence Is contained In a first non-autonomous transposon and said 
second heterologous nucleotide sequence Is contained in a second non-autonomous 
transposon and 

(B) said first heterologous nucleotide sequence Is excised by providing a first 
transposase functional with said first non-autonomous transposon and said second 
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heterologous nucleotide sequence is excised by providing a second tranposase 
functional with said second non-autonomous transposon. 

24. The process of claim 23. wherein said first and said second trahsposons in said parent 
heterologous nucleotide sequence overiap such that excision of said first or said second 
heterologous nucleotide sequence leads to disaiption of said second or said first non- 
autonomous transposon, respectively. 

25. The process of one of claims 19 to 21, wherein said first heterologous nucleotide 
sequence in said parent heterologous nucleotide sequence is flanked by recombination 
sites of a first site-specific recombinase and 

wherein said second heterologous nucleotide sequence In said parent heterologous 
nucleotide sequence is flanked by recombination sites of a second site specific 
recombinase. 

26. The process of claim 25, wherein said first site-specific recombinase is dHferent from 
said second site-specific recombinase. 

27. The process of claim 25. wherein a segment 1 and a segment 2 of said parental 
heterologous nucleotide sequence overlap, whereby 

segment 1 comprises said first heterologous nucleotide sequence flanked by the 
recombination sites functional with said first site-specific recombinase and 
segment 2 comprises said second heterologous nucleotide sequence flanked by the 
recombination sites functional with said second site-specific recombinase. 

28. The process of one of claims 15 to 21, wherein said first heterologous nucleotide 
sequence In said parent heterologous nucleotide sequence Is flanked by differing 
recombination sites of a site-specific Integrase and said second heterologous nucleotide 
sequence in said parent heterologous nucleotide sequence Is flanked by differring 
recombination sites of the same site-specific integrase. and step (c) is carried out by 

providing said site-specific Integrase to said parent organism or cells ttiereof. 
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selecting progeny of said parent organism or cells thereof containing said first 
heterologous nucleotide sequence but not said second heterologous nucleotide 
sequence, and 

selecting progeny of said parent organism or cells thereof containing said 
second heterologous nucleotide sequence but not said first heterologous 
nucleotide sequence. 

29. The process of one of claims 22 to 28, wherein said transposase, said site-specific 
recombinase. and said site-specific integrase is provided by hybridising or crossing wltii 
a plant or plant cells containing a gene coding for said transposase or said recombinase 
or by Agrobacferium-mediated transfomiation. viral transfection. particle bombardment, 
electroporation or PEG-mediated transformation with a gene coding for said 
transposase or said recombinase. 



30. 



32. 



The process of one of claims 1 to 29, whereby said first and said second loci are 
selected for a reduced probability of undergoing crossing over. 



31 . The process of claim 30. wherein said first and said second loci are conesponding loci 
on said homologous chromosomes. 



The process of one of claims 1 to 31 . wherein said first and said second plant or cells 
thereof are made homozygous for said first and said second heterologous nucleotide 
sequences. 

33. The process of one of claims 1 to 32. wherein said binding of said proteins or 
polypeptides is followed by peptide bond formation between said protein or 
polypeptides. 

34. The process of claim 33. wherein said binding and said peptide bond fomiation is intein- 
mediated ta^ns-spiidng. 

35. The process of one of claims 1 to 34. wherein said controlled distribution means ttiat. 
upon crossing of said transgenic multi-cellular plant organism witti an organism devoid 
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of said first and said second heterologous sequences, the frequency of the appearance 
of said trait in descendent organisms is less than 1%, preferably, less than 0.1%, more 
preferably less than 0.01%, most preferably less than 0.001%. 

36. The process of claim 35. wherein said transgenic multi-cellular plant organism is 
incapable of expressing said trait of interest in the absence of either said first or said 
second heterologous nucleotide sequence. 

37. The process of one of claims 1 to 36. wherein said muHI-cellular plant Is further 
genetically or transiently modified for providing functions necessary for said binding 
and/or said peptide bond formation. 

38. The process of one of claims 1 to 37. wherein said first and/or said second heterologous 
nucleotide sequence contains an IntronforRNAds-spllcing of a transcription product of 
said first or said second hetrologous nucleotide sequence. 

39. The process of one of claims 1 to 38. wherein said trait of interest is Involved in male or 
female sterility. 

40. The process of one of claims 1 to 39. wherein said trait is selected firom the following 
group: herbicide resistance, insecticide resistance, selectable mariner, counter- 
selectable marker, transcription factor. DMA or RNA modifying enzymes, production of 
a protein of interest. 

41. The process of one of claims 1 to 40. wherein said multiH»llular plant organism is 
capable of producing progeny. 

42. The process of one of claims 1 to 41 . wherein said binding and/or said peptide bond 
formation generates a protein having a polypeptide linlced thereto, whereby said 
polypeptide Is selected from the following group: signalling, targeting, and membrane 
transduction polypeptide: a binding domain, a recognition or a visualisation tag. a 
purification tag. a protein cleavage sequence. 
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43. The process of one of claims 1 to 42, wherein said second and/or said first fragment of 
said gene encoding said trait of interest is operably linlced to a regulated promoter. 

44. The process of one of claims 1 to 43. wherein said first or said second fragment of said 
gene encoding said trait of interest encodes an internal ribosome entry site (IRES) 
allowing translation of a transcript of said first or second fi^gment. 

45. Plant, seed or plant cell expressing a trait of interest, obtained or obtainable according 
to one of claims 1 to 44, and products derived therefrom. 

46. Process of pnaducing hybrid seeds, comprising producing a transgenic multi-cellular 
plant according to one of claims 2 to 44. 

47. The process of producing hybrid seeds according to claim 46, further comprising 
crossing said transgenic multi-cellular plant organism with another plant that is male 
fertile. 

48. The process of producing hybrid seeds according to claim 46 or 47. wherein trait (1) is 
a herislclde resistance and trait (2) Is male sterility. 

49. The process of one of claims 46 to 48. wherein progeny seeds of plants that were 
grown from said hybrid seeds do not reach the cotyledon stage. 

50. The process of claim 49. wherein progeny seeds of plants that were grown from said 
hybrid seeds do not germinate. 

51. The process of one of claims 48 to 50. whereby said transgenic multi-cellular plant 
organism and/or said other plant contaln(s) a non-expressable seed germination control 
gene that is rendered expressable in plants grown flnom said hybrid seeds. 

52. Hybrid seeds obtained or obtainable according to the process of one of claims 46 to 51 
and plants grown therefrom. 
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53. Plants grown from the hybrid seeds of claim 52. wherein the progeny of said plants is 
non-viable. 

54. The plants according to claim 53. wherein progeny seeds of said plants do not 
germinate. 

55. The plants of daim 53 or 54. containing an Inactive seed gennination control gene that 
can be activated by expressing an activating protein. 

56. Use of the hybrid seeds or plants according to one of claims 52 to 55 for expressing a 
protein of interest, notably a phannaceutical protein. 



57. 



Plant or seed or cell thereof obtained or obtainable according to step (i) or step (ii) 
one of claims 1 to 44. 
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Which was changed from Cys to Ala. PIC5'epsp-intM was made following the same strategy as 
for PlC5'epsp-int except that primer intepr? (caa cag tat cac tct cca ggt ttt gcg gaa tat gcc etc 
agt ttt ggc ac) was used instead of primer inteprS. 

PICint-epsp3'M is similar to construct PICint-epsp3' but differ at the junction C Intein-EPSP by 
the addition of 3 C extein amino acids instead of five, the first one mutated from Cys to Ala and 
the two other native, and by the last C intein aminoacid which was changed from Asn to Ala. 
PlCint-epsp3'M was made following the same strategy as for PIClnt-epsp3' except that primer 
inteprS (ate gcc etc gac cag ata gcg gtt aaa age age ggc ggc gat cgc ccc att g) was used 
instead of primer inteprS. 

The three mutated aminoacids completely prevent intein mediated transplldng but do not 
prevent association of the N and C intein fragments ((Chen et al.. 2001 . Gene, 262. 39-48). 
plC5'epsp-intM and plCint-epspS'M were transfbmned in agrobaclerium strain GV3101 by 
electroporation. Both constructe were transfomtted in Arabldopsis thaliana (Col-0) and tobacco 
as described above. Primary transfomiants were all sensitive to glyphosate. but hybrid F1 
plants containing both constructs, either in tobacco pr Arabidopsis. exhibited glyphosate 
resistance. 



EXAMPLE 4 

Splitting thP Arahidons is AHAS aene 

The acetolaclate synthase (AHAS) gene from Arabidopsis (Genbank accession AY042819) was 
amplified from Arabidopsis genomic DNA using primers Alsl (5' taaaccatgg cggcggcaac 
aacaac 3') and Als2 (5' gactctagac cggttteatc tctcagtatt taatc cggco atctcc 3') and cloned as an 
Ncol-Xbal fragment In Icon Genetics binary vector plCBV24 (Kan^ selection in E.coli and 
Agrcbaaterium). Ser653 was mutated to Asn by PCR using primers Alsm5 (5' caggacaagt 
ctctcgtcgt atg 3'), Als4 (5' gaaagtgcca ccattcggga tcateg 3'). Als3 (5' cgatgatccc gaatggtggc ac 
3') and Als2. The amplified mutated fragment was cloned as an Nhel - Age1 fragment. A 
second aminoacid. Pro197 was mutated to Ser by PCR using primers Als1. AlsmS. Alsm6 (5' 
acgacgagag adtgtcctg tg 3") and Alsprl. The amplified mutated fragment was subcloned as a 
Sapl-Mlul fragment. 

The rice actim promoter was amplified by PGR from rice genomic DNA using primers 
Actpri (5- atgggcgcgc cagatctgca tgccggtcga ggteattcat atgcttgag 3") and Actpr2 (5' cgecatggtt 
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